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■ Abstract 

(N ■ 

Several aspects of regularity theory for parabolic systems are investigated under the effect of random 
perturbations. The deterministic theory, when strict parabolicity is assumed, presents both classes of 
systems where all weak solutions are in fact more regular, and examples of systems with weak solutions 
which develop singularities in finite time. Our main result is the extension of a regularity result due 
' to Kalita to the stochastic case. Concerning the examples with singular solutions (outside the setting 

of Kalita's regularity result), we do not know whether stochastic noise may prevent the emergence of 
singularities, as it happens for easier PDEs. We can only prove that, for a linear stochastic parabolic 
Qh ' system with coefficients outside the previous regularity theory, the expected value of the solution is not 

singular. 

! MSC (2010): 60H15, 35B65, 35R60 (primary); 60H30 (secondary) 

1 Introduction 

Nonlinear parabolic systems of the form 

> 

^sD ' d t u = div A(x, t, u, Du), u\ t =o = u (1-1) 

(N ' 

■ on a cylindrical domain D x (0,T), with Dcl"a bounded, regular domain, u : D x [0, T] — > R , A: D x 
[0, T] x R N x M. nN — > R nJV , have been investigated by many authors, see for instance [14, 15, 13] and references 

00 ■ therein. A key feature in the vectorial case N > 1 is that, under the strict parabolicity assumption 
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and some differentiability assumptions with respect to the (x, u)-variablc, there are classes of vector fields 
A(x, t, u, z) such that all weak solutions to (1.1) are in fact more regular, and examples of systems such that 
there exist weak solutions with singularities; this dichotomy does not happen for single equations, the case 
N = 1, where regularity of weak solutions is always true, due to the (elliptic and parabolic) works based on 
the fundamental results of De Giorgi, Nash and Moser [4, 19, 18]. 

However, for second-order, parabolic systems under suitable additional assumptions on growth and regu- 
larity of the vector field A{x, t, u, z), there are partial regularity results available, yielding Holder regularity 
of the solution u (or of its spatial gradient Du) outside of a negligible set, the singular set of u (or of Du). 
Hence, for general systems which are nonlinear in the gradient variable, the best regularity to hope for is 
partial regularity of Du, with an estimate for the Hausdorff dimension of the singular set strictly below the 
dimension of R™ x [0, T], see [7]. For better estimates on the Hausdorff dimension one needs to assume 
stronger assumptions (or also some a priori information on the regularity of the solution) . Regularity of u on 
a larger set can be obtained, for instance, in low dimensions or with special structure assumptions (such as 
vector fields linear in the gradient variable), see e.g. [12, 2, 20]. Full regularity of u instead is only possible 
if even more restrictive structural assumptions are imposed. The easiest (and very classical one) of such 
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1 Introduction 



examples are linear parabolic systems with constant coefficients. In the nonlinear case, in the famous case of 
the p-Laplacian system it is also possible to prove full regularity of Du, see [6]. Furthermore, if the system 
is still sufficiently close to the Laplacian system, then we still get full regularity of u, see [14, 13]. This 
regularity result will be of great importance for our paper. 

Let us now go into some details, point out some of the structural prerequisites of the positive (full) 
regularity theory and confront it with the existing examples of systems admitting a singular weak solutions. 
For simplicity we focus here in the introduction on the case of quasilinear problems with a vector held of the 
form A(x, t)z, i.e. to weak solutions of 

d t u = div (A(x, t) Du) , u\t=o=UQ. (1.2) 

As mentioned above, without additional structural conditions on the coefficients full regularity of the 
solutions can no longer be expected in the vectorial case. It was observed by Koshelev and Kalita [14, 13] 
that if the coupling of the single equations is sufficiently weak, then discontinuities of the weak solution can 
globally be excluded: 

Theorem 1.1 ([13]). Let uq G W 1,g (D, R ) for some q > n and consider coefficients A(x,t) which are of 
class C 1 in x, measurable in t and which satisfy 

A |e 2 < (A(x,t)£,{), \A(x,t)Z\<\i\£\, and \D x A(x, t)\ < L (1.3) 

for all £ G R nN , (x, t,z) G D x [0, T] x K nAr and some positive constants \q,\i,L. If in addition ^ > 1 — ^ 
holds, then every weak solution u : D x [0, T] — > M. N to the initial boundary value problem (1.2) is of class 
C%£(D x [0,T],R N ) for some a > 0. 

It is important to mention that the original results apply to more general systems, possibly nonlinear in 
the gradient variable, provided that the vector field A(x, t, u, z) is sufficiently close to a quasilinear situation 
with a small dispersion ratio. First, Koshelev proved the existence of a regular solution (which in the situation 
above is the unique one) by studying an approximation of the system such that its solutions are regular and 
converge in a suitable norm to a solution of the original system. Later Kalita achieved the regularity result 
for all solutions with a direct argument (and not as a consequence of a suitable approximating sequence), 
which essentially relies on Moser's iterative method [18]. 

Under weaker assumptions than in the previous theorem, such a global regularity result can no longer 
be expected. In fact, in a very similar setting the following example of a system was proposed by Stara and 
John [26] (actually, the example was constructed on the full space and the solution can be traced back in 
time t —> -co), which admits a solutions that starts from a regular - in particular Holder continuous - initial 
data and develops a singularity in finite time in the interior of the parabolic cylinder. 

Theorem 1.2 ([26]). Let n = N > 3. There exist initial data uo G W 1,2n {Bi{Q), W 1 ) and measureable, 
symmetric coefficients A G L°°(Bi(0) x [0, 1), R n xn ), which are elliptic and bounded in the sense of (1.3)i 2 
for all (x,t) G -Bi(O) x [0,1), such that at least one of the solutions to the initial problem (1.2) develops a 
discontinuity in the origin x = as t /*■ 1 . 

The coefficients constructed in [26] have a dispersion ratio ^ < 1 — ^ below the critical one investigated 
in [13] as well as a lower regularity with respect to the a;-variable. For this reason we cannot exclude that 
the solution develops the discontinuity due to an interaction with the irregular coefficients. However, a 
comparison with the positive regularity results in the elliptic theory (the stationary case) suggests that the 
dispersion ratio ^ plays an important role. Indeed, in the elliptic case no regularity in the x-variable is 
required, and a modification of De Giorgi's counterexample to full regularity shows sharpness of the (elliptic) 
condition on the dispersion ratio (see [15, Section 2.5]). Unfortunately, we didn't find further counterexamples 
in the literature which could give a similar complete picture in the case of parabolic systems. 

The aim of this paper is to investigate parts of this theory under the effect of random perturbations. 
The final aim of our research project, in analogy with recent results proved for other equations, is to show 
that the regularity theory of parabolic systems is, under random perturbations, in some sense (of course 
only up to a certain degree of regularity) not worse than the deterministic one, and possibly better. As in 
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the deterministic case there is more than one approach to the analysis of these problems, so we restrict the 
attention here only to a few directions. More precisely, we want to show two results. 
First, we study systems with ltd noise of the form 

du = div (A(x, t) Du) dt + H (Du) dB t , u\ t =o = u a (1.4) 

(for H Lipschitz) where [B t )t>o is a Brownian motion of suitable dimension, and we generalize to the 
stochastic case one of the regularity results of the deterministic theory, the work of Kalita [13] (which was 
displayed also above for the special case of a quasilinear system) . The passage from deterministic to stochastic 
of Kalita's approach contains at least one non trivial detail which is rather new in the stochastic setting: the 
weak solution u we start with is not, a priori, the limit of a sequence of smooth solutions of approximating 
equations (for instance, due to the nonlinearity, classical mollifiers are difficult to implement; in another 
direction, in some cases solutions exist as limits of Galerkin or other types of approximations, but we here 
start with a weak solution which a priori has not been constructed in that way). Therefore, it is not clear how 
to perform differential calculus on u\ and Kalita's approach is hcuristically based on an equation satisfied by 
second space derivatives of u. Therefore one has to use finite difference quotients in place of derivatives, a 
classical method in the deterministic setting, but not common in the stochastic case. This leads to a number 
of technical novelties. At the end, we reach a full extension of Kalita result to a quite general stochastic case, 
which includes in particular perturbations in form of additive or of multiplicative noise. In the quasi-linear 
model case we obtain - as a particular case of Theorem 5.1 - the following result (the precise definition of 
weak solution is given in Definition 2.2 below). 

Theorem 1.3. Let uq G W l q (D, R ) for some q > n. Consider coefficients A(x,t) which are of class C 1 
in x, measurable in t and which satisfy (1.3) with ^ > 1 — ^, and assume that H is Lipschitz continuous 
with Lipschitz constant Lh < L* H for some sufficiently small L* H depending only on n, Xq and Ai . Then there 
exists a > depending only on n, Ao, Ai and q such that every weak solution u: D x [0, T] x O — > 1* to the 
initial boundary value problem (1.4) is of class Cy'°~(D x [OjTJjR^) with probability 1. 

Second, we investigate for these systems the problem recently considered for other classes of PDEs, see [8], 
namely the possibility that it is precisely the noise which prevents the emergence of singularities. The aim 
of this project, that we have reached only partially until now, would be to prove that, under assumptions on 
the vector field A(x, t, u, z) such that there exist weak solutions with singularities in the deterministic case, 
there arc no more singularities if we add a suitable noise. We can only prove an intermediate but promising 
result. We consider linear stochastic systems with Stratonovich bilinear multiplicative noise of the form 

du = div (A(x, t)Du) dt + aDu o dB t , w|t=o = u o (1-5) 

with regular, bounded and elliptic measurable coefficients A. In this situation we obtain regularity for the 
mean value. 

Proposition 1.4. Given coefficients A(x,t) which are of class C 1 in x, measurable in t, and which sat- 
isfy (1.3), there exists Co > depending only on Ao,Ai such that for all a > (To, all initial conditions 
uo G W 1,q (D, Wi N ) with q > n, and all weak solutions u of equation (1.5) we have that the function 
(x,i) i — > E[u(x,t)} is locally Holder continuous on D x [0,T]. 

This result is proved by applying the deterministic results. The key observation in the proof is that the 
average solves an equation with a better (that is greater) dispersion ratio. So far this class does not cover the 
counter-examples in [26] mentioned above (because here we need more regularity of A than in the counter- 
example), but however the theory presented here might be of its own interest. We cannot take cto = but 
we suspect that this is the critical value (namely that for all noise intensities the regularization takes place) . 
The fact that this result holds independently of the initial condition u$ - though sufficiently regular - and 
of the specific form of A(x, t) in the functional class we consider (in particular the fact that no symmetry 
is embodied in our assumptions which could justify compensations due to the expected value), leads us to 
think that in fact u(x,t) itself is Holder continuous, but we do not have a proof of this conjecture. 

Several problems remain open: 



4 



2 Setting and assumptions 



(i) whether counter-examples exist also in the stochastic case under some conditions on A; 

(ii) the regularity of u(x, t) itself in the rcgularization-by-noise above and other related issues, such as the 
value of do and a generalization to the nonlinear case; 

(iii) the generalization of other deterministic approaches to regularity. 

Concerning the existence of weak solutions, we could give a quite general result, but since it is related to 
the generalization of the approach of [14, 15] to regularity, we postpone it to a future work. 

2 Setting and assumptions 

Consider n, n' G N with n > 2, T > 0, and D C R" a (regular) bounded domain. Let (SI, 5F, P) be a complete 
probability space with a filtration (9 r t ) t >o, and let (B t )t>o be a standard n'-dimensional Brownian motion. 
Let further A : D x [0, T] x R N x R nN x SI — > R nN be a vector field satisfying the following properties: 

• A is progressively measurable, i. e. for every t G [0, T] the restriction of A to D x [0, t] x M. N x M. nN x SI — > 
M n7V is 23(D) x 23([0,i]) x SQR^) x 23(M nJV ) x J t measurable; 

• A(a;, i, u, z, w) (usually abbreviated by A(x,t,u, z)) is differentiable in x,u and z (with 3Vadapted 
derivatives), and it satisfies for P-almost all w G SI the following assumptions concerning growth and 
cllipticity: 

\A(x,t,u,z)\ <L(\z\ + \uf¥ +/*(i,t)) 

2r)C| 2 < (l-^ 2 )l^| 2 
|D^4(x,t,u,a)|<L(|2r|^ + |u|'s:+/(a;,t)) 
t \D x A(x,t,u,z)\ <L(\z\ + \u\^ + f 2 {x,t)) 

for all (a;, t) £ D x [0,T], u S K w and z,^ 6 R nAr , some constants k,u,L > 0, and an ^-adapted 
process / which with probability one belongs to L a (D x [0, T}) for a fixed number a > n + 2. 

Moreover, let H : D x [0,T] x R n7V x SI — > K™ w be progressively measurable, of class C 1 in x, Lipschitz 
with respect to the gradient variable of at most linear growth, uniformly in (x,t), i.e. 

H(x, t, z,u>) — H (x, t, z,lu)\ < Lh \z — J\ , 

H(x,t,z,uj)\ < L(f B (x,t,u) + \z\), (2.2) 
D x H(x,t,z,u)\ <L(ff 1 (x,t,Lj) + \z\) 

for a constant L B , all (x,t) eflx [0, T], z,z € M. nN , and almost every u> € f2. Here, denotes a function 
in L a (D x (0,T) x SI). 

Under these assumptions we consider a stochastic partial differential equation with noise of the form 

du = divA(x,t,u,Du)dt + H(x,t,Du)dB t in D T := D x (0, T) , (2.3) 

where u : £>t x SI — > M w is a random function. The stochastic integral is here understood in the Ito sense. 
According to the growth condition on the vector field A, we note that for P-almost every w £ SI and all 
t G [0,T] we have div A(x,t,v, Dv) G W'^iD,^) - the dual space to Wq' 2 (D, R n ) -, provided that 
v G W 1 - 2 (D,R N ). 

Remark 2.1. We have chosen this level of generality of the noise for two reasons: to keep a simple PDE 
structure instead of an abstract operator formulation, and to cover two interesting examples: additive noise 
(with H(x, z) independent of z ) and bilinear multiplicative noise with first derivatives of u (with H(x, z) linear 
in z). A priori there is no conceptual obstacle to consider H depending also on u and also to generalize to 
the case of a Brownian motion B in a Hilbert space U , with suitable assumptions on H , but for simplicity 
we restrict ourselves to the previous case. 
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The function spaces that will be needed in the sequel are the Banach spaces 



V m ' p (D T> WL N ) := L°°(0,T;L m (D,R N )) nL p (0,T;PU 
V™> P (D T ,R N ) := L°°(0,T;L m (D,R N )) nL p (0,T; W ( 



hp (D 7 R N )) 



with m,p > 1, and they are equipped with the norm 



\\u\\v'".p(d t ,r") ■= esssup||u(i)|| L m (D!R .v) + \\Du\\ LP{DTtRN) . 
te(o,T) 



When m — p we shall use the abbreviations V p ^(Dt,R n ) = V P q P (Dt,R n )- We remind that the spaces 
V m ' p (D T ,R N ) are embedding in the Lcbesgue space Li(D T ,R N ) with q = p'-^ > p (see [6, Proposi- 
tions 1.3.1, 1.3.2]). We will need only the result concerning the cases m = 2, p > 2 or m = p > 2. In the 
latter case, the embedding reads as follows (see [6, Propositions 1.3.3, 1.3.4]): let v G Vq(Dt,R n ), p < n. 
Then there exists a constant c depending only on n and p such that 



(and an analogous result holds without any restriction on the boundary values of v on dD x (0, T) if dD is 
assumed to be sufficiently regular). 

We are now going to study the properties of weak (or variational) solutions to the system (2.3), which is 
to be understood in the following sense. 

Definition 2.2. An 9^ -progressively measurable process u on [0,T] x is called a weak solution to the 
system (2.3) with initial values uo G L 2 (D 7 R N ) if P-a. e. path satisfies u(-,uj) G V 2 (Dt,R n ) and if for all 
t G [0,T], we have P-a.s. the identity 



When a solution is progressively measurable with respect to the (completed) filtration associated to the 
Brownian motion, it is usually called a "strong" solution in the probabilistic sense, sec [23, Section IX. 1]. We 
do not require this condition, so our result will also apply to the so called "weak" solutions in the probabilistic 
sense (those for which there is a filtration (9 r t)t>o such that u is ^-progressively measurable and B is an 
3VBrownian motion). We further note that according to the definition above, a solution is defined as an 
equivalence class in the sense of versions (a process Y is a version or modification of a process X if for each 
time t we have P-a. s. X t = Y t ). Hence, regularity of a weak solution is always to be understood as finding 
a regular representative in the corresponding equivalence class. 

Moreover, we comment on the way in which the initial values are attained. Under mild assumptions on 
the growth of A and H with respect to the gradient variable one actually deduces from the equation itself 
that u belongs to C°(0, T; L 2 (D', M. N )) P-a.s. for every D' <s D, compare formula (4.1) and the beginning 
of Step 3 on p. 15. Under further assumptions on the trace of u on dD x [0, T] this extends to continuity of 
the full L 2 -norm, with D' = D. In this sense the term "initial value" in the definition of a weak solution as 
a function in the space V 2 is justified. 

3 Preliminaries 

In this section we recall some well-known facts and provide some technical tools. For convenience of the 
reader we state two suitable versions of Ito's formula. Furthermore, in analogy with the deterministic theory, 
we discuss a sufficient condition for the "existence of weak derivatives with probability one" , and we further 
give a criterion which guarantees pathwise Holder continuity. 



\ v \\li(D t M n ) < c \\ v \\vp(D t M n ) 



(2.4) 




for all ip G Wq' 2 (D,R n ). 
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3.1 Ito formula 

We first recall two versions of Ito's formula, the first one the standard version for TV-dimensional processes 
and the second one for processes with values in Hilbert spaces. Consider (f2, F, P) a complete probability 
space and let 



be an iV-dimensional Ito process which satisfies: a, b are 3Vadapted (i-e., the maps uj t— > a(t,Lj),b(t,u>) are 
3t measurable), (t, oj) i— > b(t,uj) is 23([0,T]) x ^-measurable and 



Then the following general Ito formula holds (see e.g. [21, Theorem 4.2.1]). 

Theorem 3.1 (Ito's formula I). Let g(t, z) = (gi(t, z), . . . , g p (t, z)) be a map from [0, T] x 1* to W of class 
C 1 in t and of class C 2 in z. Then the process Y(t,uj) := g(t,X(t)) with X(t) defined in (3.1) is again an 
ltd process whose components are given by 



for all k £ {1, . . . ,p\, and with dBi dBj = 8^ dt and dBi dt = = dtdBi for all i,j S {1, . . . , N}. 

In the sequel, we will also employ the following version of the Ito formula in Hilbert spaces that can be 
found in [16, Theorem 3.1] or [24, Chapter 4.2, Theorem 2]. 

Theorem 3.2 (Ito's formula II). Let V C H C V be a Gelfand triple, with H a separable Hilbert space. 
Assume that we have for L 1 x P almost all (t,w) 



for every ip E V where x(t,ui),y(t,uj) are taking values in V and V , respectively, and are progressively 
measurable with 



dX(t) = a(t) dt + b(t)dB t 



(3.1) 




,( t ) = ^(t,X)dt+T t ^(t,X)dX i + -y i -^L(t,X)dX l dX 3 
dt ^ d Vl 2 dyiyj 




(3.2) 




and where M t is a continuous local martingale with values in H . Then there exists a set Q C O with -P(O) = 1 
and a map x(t, uj) with values in H such that: 



(i) x(t) is Jt-adapted, continuous in t £ [0,T] for every w£(l, and x(t) 



x(t) P-almost surely; 



(ii) for every uj £ fl and every ip £ V there holds 




(Hi) for every u> £ f2 there holds the inequality 



m\\ 2 H 




o Jo 
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3.2 Weak derivatives 

For a vector- valued function /: R™ D D — > R N . k G {1, . . ,,n} and a real number h G R we denote by 
Ak t hf(x) := h~ 1 (f(x + hek) — /(x)) the finite different quotient in direction and stepsize /i (this makes 
sense as long as x, x + /iefc G D). Let p > 1, / G LP (D), k G {1, n} and let be the derivative of / in 
the direction k in the sense of distributions. Just for comparison, let us recall the following lemma (not used 
below). 

Lemma 3.3. If there is h n — > and gk G L P (D) such that 



for every ip G Cq°(D), then Dkf is in L P (D) and is equal to gk- 

As an immediate consequence of this lemma and of the compactness of the L p -spaces with p > 1 with 
respect to weak (or weak-*) convergence, we obtain a simple criterion for the existence of the weak derivative 
Dkf in IP ', namely it is sufficient that || Ak^hfWLpfD') is bounded by some constant, for all h and every D 1 <s D 
such that \h\ < dist{D',dD). 

Now this well-known principle shall be carried over to a probabilistic setting. Let (Q, F, P) be a complete 
probability space and consider a function / in the Banach space L P (D x f2). A function gk G LP(D x fi) is 
said to be the weak derivative of / in the fc-direction if 



for every ip G C^°(D) (taking a countable sequence and using a density argument, the property "for every 
ip G C^°(D)" can be written inside the probability). We then write Dkf = gk- The previous lemma has a 
generalization to functions in IP(D x CI). 

Lemma 3.4. If there is h n — > and gk G L P (D x O) such that 



for every tp G C£°(D) and every bounded measurable X : — > R, £/ien Dkf is in IP(D x O) and is egua/ to 



Proof. Since X and y> are bounded, we may apply (first Fubini and then) Lcbcsgue's dominated convergence 
theorem, and we get 






(A /£ ./ in /(x,o;) - g k (x,w))<p{x)X{uS) dxdP(u>) = 




When h n < dist(spt (p, dD), this is equal to (we apply Fubini twice and a change of variables) 




This limit is zero by assumption, hence 




The arbitrariness of X implies J D (fDk(p+gk<p) dx = 0, as a random variable on ft. The proof is complete. □ 
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Corollary 3.5. If there is a constant C > such that 



E 



|A fcih /(x)| p dx 



< C 



for all h and all D' <s D such that \h\ < dist(£)', dD), then D k f is in L P (D x ft). 

Proof. The family g k ,h(x,uj) := A k ^f{x,uj) is equiboundcd in L P (D' x S7), hence there is a sequence h n — > 
such that <7fc h„ converges weakly in L p (D x 17) to some function g k G L p (D x S7). The product </?(x)X(u;) is 
in L p (D x 17) (p' conjugate to p) for every ip G C£°(D) and every bounded measurable X : SI — » R. Hence, 
we may apply the lemma and obtain the assertion. □ 

First, for our later application, we replace D by D x [0,T] and we allow different integrability exponents 
with respect to the variables in [0,T] and D, respectively. Let / : — > L q (0, T; L P (D)) be a measurable 
function with p G (1, oo) and g > 1. We say that a function <?& : 17 — s> L 9 (0, T; L P (D)) is weak derivative of / 
in the /c-direction with probability one if for a. e. (t, u>) G [0, T] x 17 we have 

f D k (fidx = - I gkfdx 

J D 

for every <p G Cq°(D), and we then write D k f = g k . Furthermore, let us generalize to a scheme where we 
relax the integrability in 17. 

Theorem 3.6. Let Y : [0, T] x SI — > (0,1] be a positive random variable, with P(inf te r 0i T] Y > 0) = 1. If 
there is a constant C > such that 



E 



\Y(t)A k , h J(x,t)\\] \ LV 



(0,T;Lp(D')) 



< c 



for all h and D' <s D satisfying \h\ < dist(Z7', dD), then Dkf G L q (Q,T; L P (D)) with probability one and 
there hold 

YA k , h f -»• YD k f weakly m L P (Q; L q (0, T; L P (D))), 
E [\\ YD *f\\Uo,T-,L* m }^ C 

with the same constant C . 

Proof. The family Z k , h (x,t,uj) := Y(t)A k , hn f(x,t,uj) is equibounded in L p (17; L q (0, T; L P (D'))), hence there 
is a sequence h n such that 2/- ) / ln converges weakly in L p (fl; L q (0,T; L P (D))) (or weakly-* if q = oo) 
to some function Z k G L p (17; L q (0,T; L P (D))). This implies (again with X bounded, measurable and </? 
smooth, compactly supported) 



n— >oo 



lim / / / (y(t)Afc, hn /(aj,t)-Z fc (x,t))^(a?)^(t)Xda?£ftdP = 0. 

'Dx[0,T]xfi 



Hence, by Fubini and change of variables as above, we find 



Tl— >OC 



lim / / / (y(t)/(a:,t)A Jk ,_ /ln ^(a:) + Zfc(a: I t) V j(a;))^(t)Xda:dt<iP = 1 

L>x[0,T]xf2 



which in turn implies by Lebesgue's theorem 

(Y{t)f(x,t) D k ip(x) + Z k (x,t) (p(x))ip(t)X dxdtdP = 0. 



'£>x[0,T]xfi 

Arbitrariness of X and ip thus yields 



D 



{Y(t)f(x, t) D k <p(x) + Z k (x, t) <p(x)) dx = 
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for a. e. (t, u) € [0, T] x 17. Therefore, we have 



D 



(f{x, t) D k (p(x) + g k {x, t) <p{x)) dx = 



(3.3) 



fora.c. (t,u) G [0,T]xQ, where g k = Y- x Z k . Since Z k belongs to L p (fl; L q (0, T; L P (D))), it is L q (0, T; L P (D)) 
for P-a. c. to € CI. Hence, by assumption on Y, we also have g k G L q (0, T; L P (D)) for P-a. e. to 6 J7. The 
only difference with the definition of g k being the "weak derivative of / in the /c-direction with probability 
one" is that the negligible set of (t,uj) G [0,T] x fi where (3.3) may fail depends on ip G C^°(D), until now. 
But W 1,p (D) (p' is conjugate to p) is separable and C§°(D) is dense in it. Hence, there is a countable 
family {(f„} C C^°(D) which is dense in W 1,p (D). If we call TV the countable union of all negligible sets of 
(t,uj) G [0,T] x fl where (3.3) may fail for {f n }, N is negligible, and on the complementary we have (3.3) 
for every ip n , hence by density for all tp G W 1,p (D) and then for all tp G C^°(D). Having identified g k as 
the weak derivative of / in the fc-direction we take advantage of the lower semi-continuity of the norm with 
respect to weak (or weak-*) convergence and thus find 



E[\\YD k f\\l 



Li(0,T-Lv(D))\ 



E[\\Zk\\L"{0,T;LP{D))] < C. 



The proof is complete. 



□ 



Remark 3.7. This result will be applied later in the cases p = q where the assumption then reads as 

rT 



E 



\Y{t)A k Mx,t)\ p dxdt 



< C 



10 JD 

and where we have L P (VL] L q (0, T; L P (D))) = L P (D x [0, T] x fl), or in the case q = oo where we then require 

E\ sup f \Y(t)A kth f 2 (x,t)\ p dx\ <C. 

l te(0,T)JD J 

From the theorem we then conclude that D k f\ G L P (D x [0, T]) and D k ,f2 G L°°(0, T; L P (D)) with probability 
one, respectively. In particular, if we take a function f G W ,P (D) and if the previous assumptions are 
satisfied for f\ = Df and $2 = f, then the conclusions are equivalent to D k f G V p (Dt). 



3.3 A criterion for pathwise Holder continuity 

We next discuss a criterion which guarantees Holder continuity of (a suitable representative of) a given 
functions u : D x [0, T] — > M. N . For example, Sobolev's embedding theorem provides a criterion easy to apply 
if u is in a suitable Sobolev space W 1,q (D x [0, T], M. N ) - but which in general is not satisfied for the solutions 
considered in our paper since derivatives in time need not exist. Instead, we now prove that it is sufficient 
that only the spatial derivatives belong to a suitable Lebesgue space, provided that a weak form of continuity 
in time (i. e. of the L? (P)-norm) is available. 

Lemma 3.8. // a function u: D x [0,T] — » 1SL N has the properties 

Du G P°°(0, T; L n+a {D, R nN )), u G C^O, T; L 2 {D, R N )) 
for some a, j3 > 0, D C M n a bounded, regular domain, then 

u G C(D x [0,r|,R^) 
for some 7 > 0, depending only on a, j3 and n. 

Proof. First, we deduce spatial Holder continuity for every time slice. From the assumption Du G L°°(0, T; L n ~ 
we deduce u G L°°(0, T; C S (D)) for some <5 > 0, depending only on a and n, by Sobolev's embedding theorem. 
Namely, there exists C\ > such that 

\u(x,t)-u(y 1 t)\<C 1 \x-y\ S (3.4) 
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for all t E [0,T], x,y G D. 

Our next aim is Holder continuity in time, at a fixed point. From the inequality 

\\u(;t)- U (;s)\\ LHD) <C 2 \t-sf 

for s,t G [0, T], we infer for every set B C D 

m£ B \u(x,t) - u(x,s)\ < ^J^\u(x,t) - u(x,s)\dx < * \\u(-,t) - u(-, s)\\ L 2( D) < ^JL^L. 

Let xo G D be given. In order to prove Holder continuity in time at xo, we estimate 

|u(a;o, t) — u(xq, s)\ < \u(xo, t) — u(x, t)\ + \u(x, t) — u(x, s)| + \u(x, s) — u(xo, s)\ 
< 2C\ \x — xq\ S + \u(x, t) — u(x, s)| 

for every x G D. Hence, if we take a; in a ball B(x$, p), we have 

|u(xo, t) — u(xa, s)\ < 2C\p s + inf \u(x,t) — u(x, s)\ 

xEB(x ,p) 

where C3 is such that \B(xo,p)\ = /5 n /C|. Let us now choose p = \t — s\ e for some e > 0: 

\u(x , t) - u(x ,s)\ < 2d \t - sf + C 3 C 2 \t - sf~ en/2 . 

If we choose for instance e = /3/n, we get 

\u(xo,t) — u(xq, s)\ < Ci\t — s\ v (3-5) 

for some ij, C4 > 0, independently of xq G D, t, s G [0, T}. The constant 77 depends only on j3, 8 and n. 

From (3.4) and (3.5) it is now straightforward to deduce the claim of the lemma. □ 

With the previous lemma at hand, we now give a criterion in the probabilistic setting, with (0, F, P) a 
complete probability space, which is adapted to weak solutions. 

Proposition 3.9. Let u : D x [0, T] x Q — > R nN have the properties 

P(Du G L°°{0, T; L n+£ (D, R nN ))) = 1 (3.6) 



u(x, t) = Uq(x) + / a(x,s)ds+ / b(x,s)dB s 
Jo Jo 

for some e > 0, uq G L 2 (D), and with progressively measurable fields a, b such that 

P( / I \a(x,s)\ 2 dxds+ f ( I |6(x,s)| 2 d2;) 2 ds < 00) = 1. 



J_D JO ^JD 

Then 

P(u G C~<(D x [0,T])) = 1 

for some 7 > depending only on e. 

Proof. Step 1. If we prove that, for some j3 > 0, 

P( U e^(0,T;I 2 (D))) =1, 
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then we get the claim of the proposition after the pathwise application of the previous Lemma 3.8 (using in 
particular the stated independence of the Holder exponent). To this end we observe that the function u is 
the sum of two terms: 



u\(x, t) = uq(x) + / a(x,s)ds, u 2 (x,t) = I b(x,s)dB s 
Jo Jo 

The term m is, with probability one, of class W /1 ' 2 (0, T; L 2 (D)), hence it is of class C 1 / 2 (0, T; L 2 (D)): 



\\ui(t) - mi(s)|U2 (£) ) = / a(-,r)dr 



L 2 (D) 



<\t-s 



1/2 



T 

J D 



a(x, r)\ 2 dx drj 



1/2 



So it only remains to prove that, for some /? > 0, 

P(u 2 eC p (0,T;L 2 (D))) = 1. 

Step 2. For R > 0, let 



t r = inf {t e (0, T] : J \\b(-, s)\\ 2 +l D) ds > R} 



if the set is non empty, otherwise tr = T. Let Q,r C ft be the set where tr = T. The family {Qr} R>a is 
increasing, with 

p( (J "«) = 1 

R>0 



because by assumption we have P(Jq \\b(-, s) H^t^) ds < oo) = 1. We now set 

b R (x,s) = b(x,s)l s < TR and u 2 , R (t) = / b R (x,s)dB s -- 

Jo 

We then have 



tAr H 



b(x, s) dB s 



\\b R (;S,U))\\ 2 L i s (D) ds<R 

uniformly in ui. Hence, for every p > 1, we find 



E 



\\u2, R {t) - u 2 ,r{s)\\ p l 



L 2 (D) 



= E 
< C V E 



b R (-,r) dB r 



L 2 (D) 



\\ b R(-, r )\\h(D) dr 



< CJt ~ s\^+^E 



\b R {-,r)\\ 2 +l D) dr 



< C v R^\t - sl^+^T. 



This implies, for p = p(s) sufficiently large, by Kolmogorov's regularity theorem for processes taking values 
in L 2 {D) (see [3, Theorem 3.3] for a version in Banach spaces), that u 2 r has a Holder continuous version in 
L 2 (D) 

\\u 2 , R {-,t,u) -u 2: r{-,s,uj)\\ l2{d} < Cp >R (u})\t - sf 
with /? any Holder exponent with /? < 2/2+e) • ^ or w £ ^i? we thus have (recalling the definition of u 2R ) 

\\u 2 (-,t,w) - u 2 (-,s,w)|| i2(£)) < Cp )H (a;)|t - sf. 

Since Ui?>o ^-R * s °f -P-measure, we obtain u 2 £ (0,T; L 2 (D)) for P-a. e. w 6 U. Now the previous 
Lemma 3.8 can be applied, and the proof is complete. □ 
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3.4 A technical lemma 

In Kalita's paper a crucial point is to show higher regularity (such as higher integrability and differentiability) 
not only for the solution, but also for powers of the solution (resp. its gradient). For this purpose the following 
technical lemma was essential. 



Lemma 3.10 ([13]). Let u: W 1 —¥ M. N be a function which is a. e. differentiable. Set v = u\u\ s with 

t s \ 



s G (— 1, oo). Then, for /i(s) := 1 — (of— ) 2 > we have a. e. 



Du-Dv> //2(s) \Du\ \Dv\ . 

We need the following modification of this result, which on the one hand allows to test the system with 
powers (truncated for large values) and which on the other hand satisfies an estimate corresponding to the 
one from Lemma 3.10. 

Lemma 3.11. For every K > and every q>l there exists a C 2 '-function T Qi k- K + — > M + such that 

(i) T q K is strictly increasing and convex on R + , and it satisfies T qt x(t) = t 2q for all t < K; 

(ii) for all t 6 M + and a constant c(q) the growth with respect to t is estimated by 

T q ,K(t)+T^ K (t)t + T^[ K (t)t 2 <c(q) min{A^- 2 i 2 ,^}; 

moreover, the inequalities T q ' K (t)t - T' q K {t) < 2(q - l)T' q K {t) as well as T' q ' K {t)t 2 < c(q)T' q K {t)t < 
c(q)Tq t K{t) hold true on R + ; 

(Hi) If u: M" — > l w is a function which is a. e. differentiable and fi(q) := 1 — (^^-) 2 , then for the function 
v = T' q K (\u\)\u\~ 1 u the following inequality is satisfied a. e.: 





Du-Dv> y/jI(4j\Du\\Dv\ > vW) T «,ir(H) M" 1 \ Du \ 2 ■ 
Proof. We first assume K = 1. We set 

T M-f ^ ' lit ^ 1 
at 2 + bt + c if t > 1 

for some coefficients a, b, c € M. to be determined as follows. The C 2 -regularity condition implies that the 
following linear system has to be satisfied: 

q(2q-l) 
-43(3-1) 
,1 -3<7 + 2g 2 , 

We now calculate some crucial quantities. We first observe that 

, f 2qt 2 «- 1 if *< 1 

i> l{ ) \ 2q(2q - l)t - Aq(q - 1) if t > 1 

is strictly increasing and positive on M + . Thus, we immediately obtain assertion (i) of the lemma. Further- 
more, we have 

T" (t\t-T> It) - / 4 <?(<? - l)* 29 " 1 « * < 1 

U,iWt 43(3-1) if £ > 1, 

which is again positive on R + . Moreover, for all t E M + wc obtain 

T^(t)t-T qA (t)<2(q-l)T qA (t), (3.7) 
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which in particular yields the inequality T ql {t)t 2 < c{q)T' q i(t)t of assertion (ii). The last inequality T' q i(t)t < 
c(q)T q _i(t) is also checked easily. For the function v given in (iii) we next compute 

DiV a = T' q>1 {\u\)^ + (T>' A (\u\)\u\ - T q i(\u\)) Di 



\u\ 3 



Du-Dv = r qA (\u\)^- + {T^(\u\)\u\ - T qA (\u\)) 1 -^^. 

In particular, this shows Du ■ Dv > 0, using again the positivity of T ql (t)t — T' q 1 {t) and of T' q ^t) on M + . 
Furthermore, we obtain 

\Dv\* = T^(\u\) 2 1M + (T^(\u\)\u\ - T q<1 (\u\)) 2 

+ 2T; 1 (H) (T^(\u\)\u\ - T qA (\u\)) i^j^ > T qA (\u\f , 

which yields the second inequality in (iii). Now, keeping in mind the definition of ii(-), we find via the 
previous estimate (3.7) 

'<Z-1\2 , „ „ 2 I^| 4 , (rr,n n m , m n \Du ■ 



\Du ■ Dv\ 2 M (ff) \Du\ 2 \Dv\ 2 = («— ) T> A (\u\) 2 + (T^(\u\)\u\ - T qA (\u\)Y 

<rr" r\ hi I r p l (\ i^ 2 2g- 1 |£>m-m| 2 |£»m| 2 

- ^ yi 

— ) 2 ni(M) (^i(N)i«i - km) |gu - u|2 4 |j3 " |a 

?-l\ 2 , /, ,x 2 l^«| 4 , (rr,,, n m , ^ n ,v»a 1^ " «| 4 



> (V) T '- l(|u|)2 k + W U >)M - ^.iW) 

- 2 «Zi ^ fl(H ) (T^(|u|)H - ^(H)) , 

which is non- negative by Young's inequality. This finishes the proof of (iii) for the case K = 1. To complete 
the proof of the lemma it is sufficient to observe that for general K > the coefficients a, b, c have to be 
replaced by aK 2q ~ 2 , bK 2 " 1 " 1 , cK 2q , and the conclusion then follows exactly as above. □ 



4 Higher differentiability of weak solutions 

In this section we start working on the solution u of the parabolic system (2.3). First, we prove an upper 
bound for the average of weighted norms of Du. This will be done in Section 4.1 and serves also to explain the 
general strategy to obtain such estimates. We will then extract higher regularity properties of the solution, 
still following the ideas given in Section 4.1. More precisely, as final result of this section, we are interested in 
pathwisc higher intcgrability of the gradient Du, which will be the core of the proof of the regularity result 
given in Theorem 5.1. 

4.1 An a priori estimate 

From Definition 2.2 of a weak solution u to (2.3), no a priori information is available on the expected value 
of the solution. In particular, we only know that u(u) belongs to the space V 2 (Dt,^ n ) for P-almost every 
<jj G f2, but it is still possible that the average -E[||u||y2(£) T r jvj] is infinite. Even if this cannot be excluded, 
we can win an a priori information on the average of weighted norms of u. 

The strategy for a priori estimates for deterministic elliptic of parabolic systems is simply to "test" the 
equation with the solution (or some modification of it), and an estimate then follows by employing the 
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regularity and growth properties of the system. For stochastic systems, testing with an the appropriate 
version is replaced by the application of an Ito's formula for Banach spaces. Then, a first pathwise estimate 
follows (Step 1). Since we are interested in averages, the first estimate is rewritten (Step 2) by introducing 
weights depending on the solution itself. With these weights we can finally take the expectation (Step 3) and 
end up with the desired estimate, which we now state in its precise form. 



Lemma 4.1. Let u £ V 2 (Dt,^ n ) be a weak solution to the initial boundary value problem to (2.3) under 
the assumptions (2.1) 12 , (2.2) 12 and with u( - , 0) = u (-) G L 2 (D,WL n ). Suppose further that the smallness 
condition L 2 H < 2k~ 1 (1 — (1 — v 2 ) 1 / 2 ) is satisfied, and let Dq C D with d a := dist(D ,dD) > 0. Then there 
holds 

fT 



E 



e-S^ G ^ ds \\Du{t)\\l, {DQ) dt 



<c {\\uo\\ 2 L 2 (D) + l + E[ 



fs\\LS(D a 



for a constant cq depending only on D, L, Lh, do, k and v, and a function Go(u, f) given by (4.3). 



Proof. Step 1. A preliminary pathwise estimate. We start by multiplying the equation (2.3) with a standard 
cut-off function 77 6 C°°(D, [0, 1]) which satisfies i] = 1 on Dq and \Drf\ < c(do). Obviously, the map nAk.hU 
has the same properties concerning intcgrability and mcasurability as u, and the Ito formula from Theorem 3.2 
in Banach spaces may be applied with the Gelfand triple W^ 2 (D,R N ) C L 2 (D,R N ) C W~ 1 ' 2 (D, R N ). This 
yields the existence of a subset 11' C U of full measure P(£l') = 1 and a function u' : [0,T] — > W 1 ' 2 (D,R N ) 
which satisfies: u' is ^-adapted on [0,T] x il', continuous in t for every u 6 fl', and u 1 = un holds for 
P x ^-almost all (t,uj) 6 [0,T] x Vt. Moreover, using the integration by parts formula, we have for every 



t 

2^ 



\W(t)\\ 2 L2{D) +2 / (D(u( S )r 1 2 ),A(-, S ,u( S ),Du(s))) L 2 {D) ds 
Jo 

= IM£ 2(D) +2 / (u'(s) 7], H{-, s, Du(s)) dB s )l 2 (D) + I \\H(; S ,Du( S ))n\\ 2 L2{D) ds (4.1) 



(with the convention \M\ 2 = ^ or evei T nxn matrix). Next we need to estimate the second 

integral on the left-hand side of the previous identity, employing the assumptions (2.1). For this purpose, we 
first observe with (2.1)1,2 and Young's inequality that 

(Du(s) T] 2 ,A(-,s, u(s),Du(s)) ) L °-(D) 



\ Du{s) rj 2 , D Z A{ ■ , s, u(s),rDu(s)) Du(s) )l*{d) dr + ( Du(s) r] 2 ,A{- ,s, u(s), 0) )l*(d) 



> 



-(1 - (1 - ^ e) \\Du(s) V f L2(D) c{e-\L) (Wu^f^ + \\f(s)\\ a La{D) 

rt L n (D) 



for all s S (0, T) and every e > 0. Moreover, we find 



\(u(s) ® Dnn,A(- ,s,u{s),Du(s))) L 2 (D) \ 

< -\\Du(s)r,\\l 3(D) +c{L,e-\K,d Q ) (\\u{s)\\ 2 L2[D) + \\u(s)\\^ 2) + \\f(s)\\ a La(D) ) . 

K L n (D) 

Next, the integrand of the last term on the right-hand side of (4.1) is bounded via (2.2)i ; 2 by 
\\H(-, s, Du(s)) n\\ 2 L2(D) < (L% + -) \\Du(s) V \\ 2 L2(D) + c^" 1 , k) \\f H ( S )\\h {D) . 



Combining the last three inequalities (here enters the smallness assumption on Lh) with (4.1), choosing 
e sufficiently small (in dependency of Lh and v) and using Holder's inequality, we thus end up with the 
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announced pathwise estimate 

lK(*)llia(r>) +c~ 1 (L h ,k,u) / \\Du(s)r]\\ 2 L 2 iD} ds 

Jo 

< hovWhm) + 2 / (u'(s)r),H(-,s,Du(s))dB s ) L 2 {D) 
Jo 

+ co(D,L,L H ,do,K,v) I (1 + \Hs)f£ 2 , +\\m\\ a La(D) + \\fH(s)\\h (D) )ds. (4.2) 

JO L n (D) 

Step 2. An improved pathwise estimate. The next step consists in getting a pathwise estimate where 
the bound on the right-hand side contains a deterministic part almost independent of the weak solution and 
the function /, and a stochastic part which might still depend on the solution. We start by defining 

Go(u I /)(a) = l + Ka)||^ a) + ||/(*)||J„ (D) (4.3) 

for s £ (0, T). Obviously, Go belongs to L 1 (0,T) with probability one. Then we use a Gronwall-type 
argument, by applying the one-dimensional Ito-formula to exp(— J cqGo(u, f)(s) ds) (1 + ||w'(£)||^2(£,p! as 
e.g. in [25, Proof of Theorem 5.1]. Thus, we get 

e" X ds \\u'(t)\\ 2 LHD) + c-\L Hi k, v) f e- K ^ (u,f)(s) dq Du{s) ^ ds 

Jo 

< HO) v\\h (D) + 1 + 2 f e- f» ^G [u,f){S) ds ( u , {s) ??) H ^ g) Du{g)) dBs )l2{d) 
Jo 

+ co f e-f» c ° G °^^ ds \\f H ( S )\\l 2{D) d S . (4.4) 
Jo 

Note that we here have omitted a negative term which appeared on the right-hand side and the positive term 
containing the 1 on the left-hand side. This is the desired improved pathwise estimate. We note that u and 
/ still appear in the function Go in the deterministic integral on the right-hand side, but in a way that for 
greater values of u or / the integral gets smaller. At the same time obviously also the exponential factor on 
the left-hand side will get smaller, but this allows us now to pass to Step 3. 

Step 3. An estimate for the expected value with weights. Uniform estimates for the average of the weak 
solution (e.g. for expressions of the form i£[||u||] for some norm of u or Du) of course cannot be expected 
under such weak assumptions as we have supposed in the lemma. But the previous inequality (4.4) now 
allows us to get a weighted inequality, with no stochastic terms on the right-hand side. Since the expectation 
of the stochastic integral is not a priori known to vanish, we now apply a stopping time argument. 

From identity (4.1) it follows that the process Hu^t)^/^ has a continuous version in t, used in the 
following argument. For every R > we introduce the random time 

tr := inf {t £ [0,T]: J* \\u' ( S )\\ 2 L2(D) \\H(s, Du( S ))r,\\ 2 L2{D) ds > r] 

with tr = T when the set is empty. We note that ||u'(s)||^ 2 / i - ) J|/f(s, Du(s))r?||^ 2 / i - ) ^ is in L 1 (0,T) with 
probability one, because of the property u £ V 2 (Dt,^- N ) and the assumption (2.2)2 on H. Hence, we have 
in particular PQiniR^oo tr = T) = 1 and 

P( lim \\u'{thTR)\\ 2 L2[D) - \\u'{t)\\l, {D) ) =1 
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for every t 6 [0, T]. Now we take inequality (4.4) at time t A tr and get 

rtf\T R 



■/„ ^^M^^Wu'itAT^Wli^+c- 1 / Wo CoGo ^ f ^ ds \\Du(s)r ] \\l 2iD) ds 



< 1^(0)77111,^ + 1 + 2 

rtAr R 



tAr R 



e" # C ° G °^)« di ( u '(«) r/, (., s, Du(«)) dB s ) L2(£)) 



Co 



■/;«*Go(u,/)(g)««|| /H(fl) ||2 ds _ 



Now we have 

ftf\TR 



i) 



and 



e -2^c„ G oK/)( S -)^ ls ^ H || u / (s) ||2 || S(fljDu(a)) „|| 



< 



tATR 



ds 



n 



||« , (a)||i a(D) ||J?(a ) i?«( a ))»y||l a(i3) d S <i? 



by definition of t#. Thus the stopped stochastic integral above is a martingale, hence with expected value 
zero. This implies 



E 



■ So ATR c G {uJ) (s)ds \\ U 'U ^ 



t R)\\l2(D) 



+ E 


-7 




Jo 



tATR 



e - / S c Go(«,/) «^||£) M ( S ) ,,|j2 2(D) rf s 



< HA^/jitor?!!^^) + 1 + c E 



\\ 2 {D) ^ 



On the left-hand-side we now apply Fatou's lemma to the first term and the monotone convergence theorem 
to the second one, and we get 



E 



e ~ So coGo(uJ) {s)ds || u '(£-)||2 



L 2 (D) 



+ E 


v 7 




Jo 



e - So coG («,/) (s)ds^ Du ^ rjf ds 



□ 



<\\uov\\ 2 mD)+^ + coE / \\f H (s)\\h {D) ds 

L Jo 

for every t G [0, T]. This proves the bound claimed by the lemma. 
4.2 Existence of second space derivatives 

We next study the existence of second order space derivatives. For deterministic elliptic and parabolic partial 
differential equations it is a standard procedure to establish the existence of higher order derivatives by finite 
difference quotients methods. The basic idea in the deterministic case is the following. Once the norm of 
finite difference quotients of Du are kept under control independently of its step size, i. e. || ^Du\\lp < C 
with C independent of h and with p £ (1, oo), then the weak derivative D^Du exists and has finite norm in 
IP (and as long as one is away from the boundary also the reverse it true). So uniformly bounded difference 
quotients of Du can heuristically be considered as second derivatives D^Du. This uniform bound in turn is 
usually achieved by "testing the system" with appropriate modifications of the solutions (formally one might 
think of Ak,-h^k,hu) and relies on the one hand on the ellipticity of the vector field A and on the other 
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hand on its regularity with respect to the x and u variables (we note that it seems mandatory to have at 
least Lipschitz- regularity in order to expect the existence of full second space derivatives) . 

For the stochastic perturbed system (2.3) the approach for proving the existence of higher order deriva- 
tives is still very similar, but we need some modifications due to the stochastic terms. The above strategy 
(with testing replaced by the the use of the Ito formula in Banach spaces) applied to our stochastic system 
gives - after some standard, though very technical computations - a preliminary pathwisc estimate for finite 
difference quotients of u and Du (this corresponds in some sense to Step 1 in the proof of the previous 
Lemma 4.1). But since this estimate still involves a stochastic integral, it is not yet possible to gain immedi- 
ately any information on second order derivatives. In a second step, this pathwise estimate is rewritten (here 
again some Gronwall-typc inequality is needed), which allows in the third step to take the expectation of a 
weighted version of ||Afc,^£)u||i2 and to bound it independently of the stepsize h. This is still sufficient to 
deduce the existence of D k Du with probability one (see Section 3.2). 

Given a deterministic initial condition uo (sufficiently regular) we now give the precise statement on the 
boundedness of the expectation of finite difference quotients of Du. 

Lemma 4.2. Let u £ V 2 (Dt, R ) be a weak solution to the initial boundary value problem to (2.3) under 
the assumptions (2.1), (2.2) and with u( ■ , 0) = uo(-) £ W 1,2 (D,R N ) . Suppose further that the smallness 
condition L 2 H < 2k~ 1 (1 - (1 - v 2 ) 1 ' 2 ) is satisfied, and let D' C D with d' := dist(£)', 3D) > 0. Then there 
holds 

sup E 

\h\<d> 



SUP e-/oVC(«,/)«ta|| A (t) ||2 + / e -Sio'G'(uJ) d s llDAkMm 2 dt 

*G(0,T) Jo 

< c' (\\D k u \\ 2 L2{D) + 1 + E[\\fp\\ 2 L2(DT) ]) 



for every k £ {1, . . . , n}, a constant c' depending only on n, D, T, L, Lh, d', K, and v , and a function G'(u, f) 
given by (4.7) further below. 

Proof. We here proceed similarly to the proof of Lemma 4.1, with the main difference that instead of u we 
now need to estimate the difference quotients A^^u. 

Step 1. We first observe that if u is a solution of (2.3), then for all t £ [0,T] by definition also the 
following identity holds true for P-a. s.: 

(77 Afc,fcu(t) - 7]A kM u ,(p }l2(d) = / (v divAfc )h A(-,s, u(s),Du{s)) 1 ip) w ^ 1 , 2{D) . w i.2 {D) ds 



/ {<p,n A k>h H(-, s, Du(s))) dB s ) L 2 [D) 
Jo 



for all if £ Wo' (D, R N ). Here k £ {1, ... ,71} is arbitrary, h £ R with \h\ < d', and for sets D' C D C D 
we denote by 77 £ C OC (D , [0, 1]) a standard cut-off function satisfying 77 = 1 on D' and \Drf\ < c(d'). 
Therefore, the map "qA^^u has the same properties concerning integrability and measurability as u, and 
the Ito formula from Theorem 3.2 in Banach spaces is again applied with the Gelfand triple W ' (D,M. N ) C 
L 2 (D,R N ) C W-^AR^)- We hence get a subset Q,' C fl of full measure P(fi') = 1 and a function 
u' k : [0,T] — > W 1,2 (D, R N ) with the following properties: u' k is Jt-adapted on [0,T] x fl', continuous in t for 
every ui £ fi', and satisfies u' k = Ak,hurj for P x £ 1 -almost all (t, oS) £ [0, T] x ft. Moreover, for every ui £ CI' 
we have 

lK(*)lli»(D)+2 / ( D(Ak.hu(s) i] 2 ), Ak,h.A( ■ , s, u{s),Du{s)) )l 2 (d) ds 
Jo 

= ||A fclh «o»/||i 2(i , ) +2 / (u' k (s) V ,A k . h H(-,s,Du(s))dB s ) L 2 {D) 



+ / \\A k>h H(;s,Du(s))r,\\ 2 L2{D) ds. (4.5) 
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4 Higher differentiability of weak solutions 



Our first aim is to deduce a pathwisc estimate for finite differences of u and Du, respectively. To this end 
we first study in detail the second term on the left-hand side. For almost every t £ [0, T] we decompose the 
finite difference quotient applied on A(x, t, u, Du) as follows 

A kih A(x, t, u(x), Du(x)) 

= h [A(x + he k , t, u(x + he k ), Du(x + hek)) — A(x + hek,t, u(x + hek), Du(x))] 
+ h [A(x + he^, t, u(x + hek), Du(x)) — A(x + e^, t, u(x), Du(x))] 
+ h [A(x + hek, t, u(x), Du{x)) — A{x, t, u(x), Du(x))] 

= / D z A(x + hek,t,u(x + hek), Du(x) + r h Ak,hDu(x)) dr Ak,hDu(x) 
Jo 

D u A(x + he k , t, u(x) + rh A kth u(x), Du(x)) dr A kJl u(x) 

l 



D x A(x + r hek, t, u(x), Du(x)) dr 

=:A(h)+'B(h) + e(h) (4.6) 

with the obvious abbreviations. Using the assumptions (2.1), Holder's and Young's inequality, we now 
estimate the different terms arising from this decomposition in equation (4.5) on time slices t 6 (0,T) (on 
such slices we omit the notion of t). We first find for almost every t € (0, T) and every e > 

(D(A fc:/lU ? 7 2 ),yi(/i)) i 2 (D) 

= K~ 1 (D(Ak,hur) 2 ),DA kM u) L 2 iD) - k~ 1 ( D(A kM u rj 2 ), DA kth u - KA{h)) L 2 (D) 

> K^WDAk^wnf^^ - 2e \\DA k ,huri\\h(D) ~ c(n,e) \\A k , h uDr)\\ 2 L2 ( D) 
-^{l-v 2 )^ \\DA k)h ufi\^ {D) 

> { K -Hl - (1 - v 2 ) 1 -) - 2e) \\DA k . h u V \\ 2 L2(D) - c( K ,e, \\D V \\ L ~ {D) ) \\D k u\\ 2 L2(D) 

where in the last line we have used the fact that the norm of the finite difference quotient of a compactly 
supported function is always bounded by the norm of the partial derivative (provided that the stepsize is 
sufficiently small). This lower bound will be crucial (and can be understood as some cllipticity of the vector 
field A up to lower order terms). We next observe with the Sobolev-Poincare embedding (applied on every 
time-slice) 

|<£(A MlU 77 2 ),S(/i)) i2(D) | 

<L(\\DA k j l uT 1 \\ L 2 {D} +2\\Akj l uDT 1 \\ L 2 {D} ) \\A k , h ur)\\ e ,„ 



x WAk.HuvW^f D) (\\Du\\" +2 2n +\\u\\- 2 



-2(D) 

Lt(.D) 



< {WDAkMUvWlt^ + \\Ak, h uDr ] \\ 1 L + 6 {D) ) 

x e{n,D,L) ||A fc , hU „||k ( % {\\Du\\^ ^ + HI*, (J>) + \\f\\ L f {D} ) 

for every 9 £ (0, 1); in the two-dimensional case n = 2, -j^^ shall be interpreted as any arbitrary number 
greater than 2. We note that we have omitted the step of passing from the shifted to the original domain in 
the first inequality and we have applied cd e + c e d < c 1+e + d 1+e for all c, d > to get from the first to the 
second inequality. To estimate further we choose 9 = according to the integrability assumptions of u 
(using the embedding given in (2.4)), Du and /. Thus, Young's inequality gives 

I ( D(A k j l u rj 2 ), H(h) )l 2 (d) \ 

< s\\DA kth ur]\\ 2 L2(D) +c(n,D,L, \\Dri\\ L ^ {D) ,e) (||A fclh u»/||| 3(D) + l) 
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x (IM^ + HlfL + \\f\\li% m ) ■ 

L re (D) 

Finally, using Young's inequality and standard properties of finite difference quotients, we estimate the last 
term involving G(h) by 

<L(\\DA k , h ur]\\ LHD) +2\\A k , h uDri\\ L 2 {D) )(\\Du\\ L 2 {D) + \\u\\ 2 % ±21 + \\f\\l HD) ) 

L n (D) 

< e \\DA kth nv\\ 2 L 2 (D) + c(D,L, \\Dr,\\ L ~ {D) , e) (l + \\Du\\ 2 L2{D) + \\u\\^ 3) + \\f\\ n L t\ D) ) . 

L n (D) 

Now we have estimated all terms coming from the integral involving A k h,A(x, t, u, Du). Next we study the 
last integral in equation (4.5). Employing the properties (2.2), we find 

\\A kJl H(s, Du(s)) vWlhd) < H/if 3 0) + \Du(s)\ n\W{D) + L H \\DA k . h u{s) rj\\v m . 

Before summarizing the previous estimates for the single terms, we introduce, for ease of notation, the 
function 

G'(u,f)(s) := 1 + \\Du(s)\\ 2 L2(D) + Hs)||^ 2) + \\f(s)\\ a La(D) (4.7) 

L n (D) 

with s € (0,T), which belongs to L 1 (0,T) almost surely. Note that by definition we have G'(u,f) > G (u,/) 
with Gq(u, f) denoting the function introduced in Lemma 4.1. Combining the latter estimates with the 
decomposition given in (4.6), using standard properties for finite difference quotients and choosing e = 
e(k, v 1 Lh) sufficiently small, we hence infer from (4.5) that for every weff there holds 



K(*)lli»(£>)+c 1 {L H ,n,v) \\DA kth u{s)r)\\ 2 L2{D) ds 

Jo 

< \\A k , h u(0) V \\ 2 LHD) +c' [ (||A fclh «(*) V \\l 2{D) + 1) G'(u, f)(s) ds 

Jo 

+ 2 I (u k (s)r),A kth H(;s,Du(s))dB s ) LHD) +2 [ \\f£* (s)\\ 2 L2{D) ds , (4.8) 



and the constant c' depends only on n, D, L, Lh, d' , k and v. Here we assume c' > Cq with Cq denoting the 
constant given in Lemma 4.1. This is the preliminary pathwise estimate on the finite difference quotients 
(which however involves the stochastic integral) and concludes the Step 1. 

Step 2. Before passing to the expectation value as described in the beginning we still need the Gronwall- 
type argument, similarly as in the proof of Lemma 4.1. However, we here observe that the second integral on 
the right-hand side is in general not known to be finite, but the first factor of the integrand "almost" happens 
to appear in the sum of its left-hand side (in the sense that u' k differs from A k j L u rj only on a negligible set). 
Hence, to get rid of this possibly uncontrollable term we apply the one-dimensional Ito-formula (recalling 
a > n + 2), and we obtain 



e~ J ° C ' G '^^ dS K(t) \\l HD) +C- 1 e- So c'o'iujm ds llDAkMs) ^2 dg 

Jo 

< J feSS c'G'(u,fXs )d s G , (uJ) (\\A k , h u(s) V f L2{D) ||u' fc (*)||i a(D) ) ds 

JO 

+ || A fcih «(0) v\\h {D} + 1 + 2 / e-So cf& («,/)(*-) ds ( u , k (s) ^ Ak hH ^ Sj Du(s) ) dBs ) L2{D] 

Jo 

■r^ c ' G ' {uJ){s)dS \\ff"(s)\\ 2 LHD) d s . (4.9) 
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Here, we again note that the average of the first integral on the right-hand side vanishes, due to the fact that 
u' k and Ak^hUf] coincide on [0, T] x f2 except for a set of L 1 x P-measure zero. 

Step 3. We now derive the estimate for the average with weights. In contrast to Lemma 4.1, we now 
derive in a first step an estimate for the (weighted) average of the £ 2 (L 2 )-norm of DAk.hU (which proceeds 
in exactly the same way as before). Then, we use this estimate to get also an upper bound for the (weighted) 
average of the L°°(L 2 )-norm of Ak.hU. 

Wc first note that identity (4.5) implies that the process 1 1 zt^. (t) 1 1 ^ 2 fz?) nas a continuous version in t. Now, 
for every R > 0, we introduce the random time 

TR:=inf{ie[0,T]: J K(s)||| 2(ZJ) \\A k>h H{s, Du{s))r,\\ 2 L , {D) ds > B,} 
with Tpt = T when the set is empty. Notice that 

\\ u 'k( s )\\ 2 L2 { D) \\ A k,hH{s,Du(s))r)f L2{D) ds < oo 

with probability one, because of the property u £ V 2 (Dt,M. n ) and the assumption (2.2) 2 on H. Hence, 
when R — > oo, tr is eventually equal to T, with probability one. In particular, P(limi?^oo tr = T) = 1 and 
for every t G [0, T] we have 



p( R hmJK(i Atr)!!^) = K(t)||i a(D) ) = i 



Step 3a. We compute inequality (4.9) at time t Atr and get 

e~ R c '°'^ «*K(t A tr)\\I hd) + c- 1 / e" K ^'(u,f) (^ l]DAkhu{s) ^ ds 

Jo 

Pt/\TR 

< c' / e -/;^'WX^G'(n,/) (\\&kMs)v\\h (D) - \\u' k ( S )\\h {D) ) ds 
Jo 

rtAT R 

+ \\A k , h u v\\h iD) + 1 + 2 e-fo C 'GW) (S)ds (^(s)^ A k , h H(s, Du(s)) dB s ) L , {D) 



o 



tATR 



+ 2 / e-/o'^W)(s)«|| / .- a(fl) ||a 



Now wc have 

rt/\T R 



,- % CG'(uJ) (I) rfg ( ^ (s)j?! Afc ^ (Sj £>„(„)) dBs )i2(D) 

- /• c'G'K/) (5) d« 1s ^ r{ u / fc(s)7?) A fe ^lf (s, £>„(«)) dP s ) £a(D) 



and 

' t e - 2 /o*-'«'(«./)0«Bl 5 < TJl || u ' fc ( s )||| _JA fe ,,P( S ,Pm(.s)H| 2 



tAT R 

\\ui(s)\\l 2{D) \\A k . h H(s : Du(s)) V \\l 2(D) ds < R 



< 



by definition of tr. Thus, the stopped stochastic integral above is a martingale, hence with expected value 
zero. This implies (using that u' k equals Ak^ur] outside a set of C 1 x P-measure zero) 



E 



tA-r _ i r r tATR 

-/„ flcG >./)(^||<(tAr fl )|| 2 2(D) l +c- 1 J e[ / e-J?^(u I /)(S)«B|| £ , Afcihtl(a) ^||2 a(D)dlJ 

2 



< \\A k , hUo r,\\i 2{D) +l + 2E / ||.^- 2 ( S )||i 2(I?) d S 
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We apply Fatou's lemma to the first term and monotone convergence theorem to the second one on the 
left-hand-side, and we get 



E 



e -/ VGW) (S)dS|| u / fc ^|| 



h(D) 



c~ l E 



e- ^ c ' G ' {uJ) {s)ds \\DA k , h u( S ) rj\\ 2 L2{D) ds 



<\\A k , h u 0V \\i HD) +l + 2E / \\f a H 2 { S )\\l HD) d S 



for every t £ [0, T]. This proves one of the two bounds claimed by the Lemma. 

Step 3b. It is almost our final estimate except that we need the supremum in time inside the first 
expected value, and thus we have to repeat the previous computations by means of martingale inequalities. 
The previous estimate (as well as the a priori estimate from Lemma 4.1) will be used in the next one; we 
found it convenient to proceed in two steps. From the stopped inequality above we have 



E 



sup e 
te[o,T] 



f ATR c'G'(u,f)(S)dSn,.l 



u k (tAr R )\\i HD) 



<\\A k , h u r 1 \\l HD) + l + 2E / \\r H 2 {s)\\l 2(D) ds 



2E 



sup 

*e[o,T] 



- /• c'G'(uJ) (.5) ds ^ ( ^ (s)??) Ak>hH (g) Du{s))dBs) 



L 2 (D) 



We apply again Fatou's lemma to the expected value on the left-hand-side. The last term on the right-hand- 
side is estimated, by means of the Burkholdcr-Davis-Gundy inequality, by 



CE 



-2f° c'G'(u,f)(s)ds 

e *-S<Tr 

T 



\\u' k {s)\\l 2{D) \\A Kh H{s,Du{s))^ 2 



L 2 (D) 



1/2 



= CE 
< CE 



,-2/, " c'G'(uJ)(S) ds 







><r s \W k (s At r )\\ l2(d) \\A kth H(s,Du(s))r]f L2 



(D 



1/2. 



rl/2 7-1/2 
1 l 1 2 



l_r.-, C 2 



where 



<^E[h]+—E[h] 



h = sup e~fr^'o'(uj ){S )^ K{tATR)r 
te[o,T] 



e -f>c'G'(u,f) (S)<K||A fcih fr(s,Du(*)) J7 || 



l 2 {d) ds- 



Hence, we have proved 
1 



E[l 1 ]<\\A k , hUo r 1 f L2(D) + l + 2E / \\fH- 2 ( S )\\ 2 LHD) d s 



C 2 



E[h). 



From the estimate above for \\A k .hH(s, Du(s))rj\\ 2 ^ 2 r D - ) and the estimate of Step 3a, we know that E\l2\ is 
bounded from above via 



E[I 2 ] <4E 



Jo 



L 2 (D. 



ds 



+ c(||A fcih uo*7lli a (i,) + l + 



E 



\L 2 (D) 



ds 



With a suitable choice of Dq (in dependency of D and D') and keeping in mind c'G'(u,f) > coGo(u, /) by 
construction, the first average on the right-hand side of the last inequality is bounded due to Lemma 4.1. 
Hence we get the bound for E[Ii] as asserted in the statement of the lemma. Since h was arbitrary and 77 = 1 
on D\ the proof is complete. □ 
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4 Higher differentiability of weak solutions 



Remark 4.3. For SPDEs having first space derivatives of the solution in the coefficient of the noise, the 
most general condition for existence of solutions in L? , which becomes also a condition for an improvement 
of W k ' 2 -regularity, is more precise than just the control on the Lipschitz constant of H expressed by the 
statement of Lemma 4-2; see [22, 16]. However, when we go to W k,p -regularity with p > 2, the computations 
are too involved and the algebraic simplicity of the condition of [22, 16] seems to be lost. For this reason we 
have simplified the estimate also for p = 2. 

Applying Theorem 3.6 with (p, q) = (2, 2) and (p, q) = (2, oo) and summing over k G {1, . . . , n) we then 
infer from the previous lemma that second order spatial derivatives of u exist almost surely. We should note 
that this result does not extend up to the boundary of D since the constant c' blows up for dist(D', D) \ 0, 
but the result holds on any fixed subset D' (e D. 

Corollary 4.4. Let u G V 2 (Dt,R N ) be a weak solution under the assumptions of the Lemma ^.2. Then 
there holds Du £ V 2 (D' T ,M. N ) with probability one, and 



E 



sup e-ti c ' G ' {u >n ds \\Du\\ 
te(o,T) 



h(D') 



e-fi c ' G '^ ds \\D 2 u\\l HDr) dt 

c(\\Duo\\h {D) + l + E[\\ff?\\ 2 L2{DT) }) 



< 



for the constant d from Lemma 4-2. Moreover, we have for all k G {1, . . . , n} 

e -| /* c'G'(uj)ds Ak hDu e -| /„« c'G'(uj)ds DkDu weaMy m L 2( D < T x ft) 



4.3 Iteration 

In the next step we want to iterate the procedure from the previous section, in a way such that we do not 
only know the spacial gradient Du to belong to the space V 2 with probability one, but that we get this result 
also for certain powers of Du. For convenience we introduce the function 

Wq : R k -S- K defined by W q {Cj := 

for every q > 0. We start by briefly describing the strategy how this regularity improvement is achieved. 
First we observe from the results in Section 4.2 that there exists a subset of f2 of full measure on which Du 
belongs to V^ c (Dt), hence we can now take advantage of higher integrability properties for u and Du. This 
shall be done with the following (but formal) iteration scheme: 

W qj (Du) eV 2 — > Du E L 2q ^ and u £ L 2q ^^ 2 n L°°{L 2q ^) 

— > D u A(x,t,u,Du) g L mia ^ i1 ^'^ and D x A(x,t,u, Du) G L min ^ q ^'^ 
— ► W qj+1 (Du)eV 2 

for a sequence {qj}j^jq of numbers qj > 1 for all j G N. The first implication indeed follows from the Sobolev's 
embedding for the space V 2 ' p , the second one from the growth conditions on the vector field A, the third one 
from the iteration (and a convergence result concerning finite difference quotients). After a finite number of 
steps we then arrive at a final (maximal) higher integrability exponent, which essentially reflects how close 
the vector field A is to the Laplace system. This should be understood in the following sense: the closer v is 
to one (note that v = 1 corresponds to the case A(x,t, u, z) = z plus potential lower order terms), the more 
integrability for Du can be gained in the iteration and the better will be the final regularity properties of u. 
Finally, we note that in every step of the iteration we will have to reduce the radius of the parabolic cylinder 
and we will also have to restrict ourselves to smaller subsets of O. Nevertheless, the higher integrability 
results will always be true on sets of probability one. 

We now start with some preliminary remarks and consider again the equation (2.3) 



du = div A(x,t,u, Du) dt + H(x,t, Du) dBt 
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in Dt- We observe that div A(x, t, u, Du) is well defined in view of the regularity assumptions (2.1) and the 
existence of second order spatial derivatives, see Corollary 4.4. More precisely, it is easy to check that for 
every weak solution u G Vq(Dt,^- N ) we have: div A(x, t, u, Du) £ L 2 oc (D', M. N ) with probability one, and 
the equation above holds for £™-almost every x G D' for L 1 x P-almost all (t,uj) £ (0,T) x ft. Hence, we 
can now work immediately with this equation without passing to its weak formulation. 
In the next lemma we will provide the main step of the iteration argument: 

Lemma 4.5. Let u G V 2 (Dx, M. N ) be a weak solution to the initial boundary value problem to (2.3) under 
the assumptions (2.1), (2.2) and with initial values u(- ,0) = Uq(') S W 1,2 (D, M. ), and assume that 



E 



\Y p PW p (Du)f 

L n {D' T ) 



< C p < oo 



for some p > 1, a set D' C D, and Y p : [0, T] x Q — > (0, 1] given by Y p (t, u>) = exp(— J Q G p (s, w)ds) for some 
function G p which is in 2^(0, T) with probability one. Let D" C D' with d := dist(Z?", dD') > 0. Then for 
every number q > 1 satisfying 



q < min 



f n 
{p- 



1 



2a-4 



T 2 



< 



1 



1 - 



all initial values uq 6 W 1,2q (D, M. ), and every k G {1, 



n(q - 2) 
. , n} there holds 



(4.10) 



sup E 

\h\<d 



sup \\Y«W q {A k , h u)\\l. {D „ 
- te(o,T) 



\Y«DW q (A k , h u)\\ 2 L2{DII) dt 



< c 



(\\W q (D k u )\\l HD) + l + E[\\f H (s) 



\L"-(D T )_ 



for Y q : [0, T] x ft — > (0, 1] given by Y q (t, ui) = exp(— J Q G q (s, ui)ds) for some function G q which is in L 1 ^, T) 
with probability one, and a constant c depending only on n,p, D,T, L, Ln,d, Kj v ' ; and C p . 

Remarks 4.6. In the case of additive noise (with Lh = 0) the second condition (4.10) for the restriction 
on the integrability exponent q reduces to the inequality q < j^— . For multiplicative noise instead, the 
right-hand side in the second inequality (4.10) is decreasing in q (note that for q = 1 it just reproduces the 
condition required in Lemma 4-2) and allows the following interpretation. Obviously, the previous restriction 
q < j— for additive noise remains valid, and in fact the more multiplicative noise is considered (in the 
sense that Lh should not be too small), the smaller will be the maximal integrability exponent which still 
satisfies both inegualities in (4.10). For this reason multiplicative noise might destroy some regularity in form 
of integrability of the gradient Du. 

Moreover, we comment on the scaling of the hypothesis and the assertion with respect to u and Uo, 
respectively, in order to avoid confusion. In view of the definition of W p it is easy to see that Lemma 4-5 is 
stated in a way such that an weighted average of a quadratic quantity in Du gives an information about the 
weighted average of a quadratic quantity in A k .hU. In this sense, the scaling is the natural one. 

Proof. We now follow the line of arguments from the proof of Lemma 4.2 (and of Lemma 4.1), but this time 
we will estimate powers of the difference quotients A k ,hU- 

Step 1. We consider k G {l,...,n} arbitrary, del with \h\ < d, and 77 G C co (D', [0, 1]) a standard 
cut-off function satisfying r\ = 1 on D" (<= D' and |_D?7| < c(d). We first observe that, by the integrability 
assumption on Du and the integrability assumption on G p (which implies strict positivity of inf t6 [ 0i T] Y p f° r 
P-almost every w), with probability one we have 



u G L 



2p(- 



' (D' T , R N ) n L£ c (0, T- L 2p ^ (D', R N )) 



Furthermore, due to the restriction on q, it is guaranteed that W q (Du) belongs to L 2 locally on D' T with 
probability one. For almost every (fixed) x G D we first consider finite differences in direction e k and stepsize 
h of the differential equation (2.3), i.e. 



d-qi A k , h u(x,t) = i]i div A kth A{x,t,u, Du) dt + rj* A k , h H(x,t, Du) dB t 
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in (0,T) for q > 1. We next introduce (because of technical reasons) for K > the approximating function 
Tq^xi') of class C 2 according to Lemma 3.11, and we recall that T q ^K satisfies in particular the polynomial 
growth conditions T q>K {t) = t 2q for all t < K and T qJi (t) < c(q)K 2q - 2 t 2 for all t e R. Employing the 
one-dimensional Ito formula (note that div A(x, t, u, Du) is as a composition of ^-adapted functions again 
Jt-adapted) from Theorem 3.1, applied with g(t,u(x,t)) = r] 2 T q ^(\ A k ^u(x, t)\), we obtain the identity 

d(r] 2 T q , K (\A k , h u(x,t)\)) 

= rfT' K (\ A k>h u(x, t)\)\A k<h u(x, t)\~ 1 ( A k;h u(x, t), div A kJl A(x, t, u, Du) ) r n dt 

+ \v 2 [^j f (|A fc , h u(i > t)|)|Afc, h u(i > t)|-T; x (|A fcih u(a:,t)|)] 

x \A k)h u(x,t)\- 3 \(A k>h u(x,t),A k>h H(x,t,Du))\ 2 dt 

+ \v 2 T^ K {\A kJl u(x,t)\)\A k j l u(x,t)\- 1 \A k , h H{x,t,Du)\ 2 dt 

+ V 2 rg,x(|Afe,/ 1 w(2;,t)|)|A fc j 1 w(x,t)|- :L (A kJl u(x,t), A kJl H(x,t, Du)dB t ) R «. 

In order to prove the assertion of the lemma, we start with a simple observation concerning the terms involving 
A ki hH(x, t, Du). Taking into account the properties of the function T q> K, see Lemma 3.11, we estimate 

[T£ K {\A kth u(x,t)\)\A k . h u(x,t)\ -T' q>K {\A k>h u(x,t)\)] \A k , h u(x,t)\- 3 \(A k . h u(x,t),A kJl H(x,t,Du))\ 2 

< 2{q-l)T' q>K {\A k , h u{x,t)\) lAfc^x^r 1 \A k>h H(x, t, Du)\ 2 . 

Wc next introduce the abbreviation 

for all £ S WL N , and we note U(£) = T' qK (\^\). Now we integrate over x £ D, and then we apply Fubini 
which due to the truncation procedure is always allowed, see Lemma 3.11 ii). Applying the integration by 
parts formula, we hence obtain 



\{T qtK \A k>h u(t)\^r l \\ L2{D) + J (D(V(A kih u{ s )) v 2 ),A kih A(;s,u,Du)) L 2 {D) ds 

< \\(Tq, K \Ak,hUo\)*y\\ L 2( D ) 

+ {q-2~ 1 ) I \\T' q , K {\A k , h u{s)\)^ \A kA u(s)\-i A kJl H(-,s, Du) n\^ (D) ds 



o 



+ / {V(A k , h u(x,s))tf,A kth H(;s,Du)dB s ) L 2 W (4.11) 
Jo 

Now the second term on the left-hand side of this inequality shall be estimated. Using the decomposition 
introduced in (4.6) and applying Lemma 3.11, we first find for every e > 0: 

(D(V(A k , h u(s))v 2 ),A(h)) LHD) 

= k- 1 (D(V(A k j lU (s)) V 2 ) , DA kJl u) L 2 {D) - K - 1 (£>(y(A fc , h u(a;,3)) 77 2 ),£>A fc , h u-Kyi(/i)) L 2 (£)) 

> K- 1 || J D(y(A fej7l u(s))) ■ DA k , h u V 2 \\ LHD) - 2e \\T qJ< (\A kM .u\)? lA^u]-' DA kih ur]\\ 2 L ^ D) 
- c{q, k, e) III A fc)h «|« D V \\ 2 L2(D) - k- 1 (1 - u 2 )i \\D(V(A kM u(s))) \DA k , h u\ rf\\ L i {D) 

> (k- 1 n*(q) - k- 1 (1 - v 2 )^ - 2s) \\T' qK (\A kth u\)^ |A fc , h u|-* DA kJl u r,\\ 2 L 2{D) 
-c(q,K,e, \\Dr]\\ L ~. iD) ) \\W q {A k:h u)\\ 2 L2{DI) . 

We observe from the definition of fi(q) and the second bound in (4.10) on q that the factor /i 5 (g) — (1 — 
v 2 )^ appearing in the previous inequality is always strictly positive. Now, for the second term in the 
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decomposition (4.6) we obtain via the inequalities T' q ' K (t)t 2 < c(q)T' q K (t)t < c{q)T q ^K{t) on K + and the 
Sobolev-Poincare embedding (applied on every time-slice) : 

\(D(V(A k , h u(s))v 2 ),H(h))mD)\ 

< c(L,q) (\\T^ K {\A k , h u\)i \A k!h u\-?DA k!h u V \\ L 2( D) + \\T' q>K {\A k , h u\)% \A k , h u\? Dr]\\ L 2 (D) ) 

x ||T; jc (|A fc)/ ,«|)»|A fc)h t t |-iS(/i) )7 || L3(D) 

< c(L,q) (\\T^ K (\A kth u\)? \A k<h u\-iDA kih ur)\\ L 2 (D) + ||T, lA: (|A fc|fc «|)» Dri\\ L2{D) ) 

x ||3^ iJS -(|A fc) h«|)' |A Jbifc u|-*A fclh u»7||* ||^,jr(|Ajfc,fcu|)s |A fclh ur*A fcl hU»7|lk ( % 

x(p«n*k +h\\; h t +11/11^^)) 

L ("+2)8 (spt rj) L^(sptrj) L.«[,U)/ 

< (\\r giK (\A hth u\)i \A k , h u\-^DA k j l ur 1 \\ 1 + e {D) + \\T qtK (\A kthU \)i ^||[+%) 

x c(n,D,T,L, q ) \\T q M\A k:h u\f> ,||-% (ll^llj^^ + Nl*, (sptj?) + ll/ll.f <„,) 

for every € (0, 1). We now choose = max{p _1 (^2) 2 , J}, for which the last expression in brackets of the 
previous inequality is consequently bounded with probability one, according to the integrability assumptions 
on /, Du and the consequences on the integrability of u explained at the beginning of the proof. Young's 
inequality then implies 

| (D(V(A kih u{x, s)) if) , -B(h) ) L 2 {D) | <e||T^ x (|A Jk ,| l «|)* \A kth u\-^DA k>h ur]f L 2 {D) 

+ c(n, D, T,L,g,||i3» 7 ||ioo (D) ,e)(||T, > j < r(|A fc) hu|)5»7||l a(D) + l) 

x (1 + H^H^ (aptT?) + Il-H 2 /^^^^ + ll/ll-C-)) ■ 

Finally, via the bounds for q in terms of n,p,a and v, the last term in the decomposition involving G(h) is 
estimated with Young's inequality and the well-known estimates for finite difference quotients by 

\(D(V(A kth u(s))r, 2 ),G(h)) L 2 (D) \ 
<c(\\T qiK {\A k!h u\)i\A kih u\-^DA kih ur]\\L2 iD) + \\T q , K {\ A kih u\)i D v \\ L 2 iD) ) 
x ||2^(|A fc)fc u|)*|A fclh «r* e(h) v \\ L 2 {D) 

< e \\T q K (\A kJl u\)i \A kih u\-iDA k>h ur]\\l 2{D) 

+ c(D,T,L,q,\\Dr 1 \\ L oo {D) ,e) (l + \\Du\f P 2 % + \\uf P ^ + \\f\\ a L « (D) ) 

n (spt?)) L'^ y n ' (spt 77) 

provided that Aq < a. For the general case, one again has to argue more subtle, using the Sobolev embedding 
on time slices as for the term with 23 (h). With the analogous calculations as before this yields 

\(D(V(A k>h u(s)) V 2 ),C(h)) L 2 {D) \ 

< e H^flAfc.hul)* |Afc lfc «|-JDA fclfc u» 7 ||i a(D) 

+ c(n,A2 , J i > g,||I)»7|U-(D),e)(||T g ,j f (|A fcih «|)»»/||i a{D) + l) 

x ( i+ ''K p s ( spt7, + '' M '':t3^spt,) + '' / ''^))- 



It now still remains to handle the second term on the right-hand side of inequality (4.11). With the assump- 
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tions (2.2) on H and Young's inequality, we easily find 

i i 1 1 2 

\\T^ K (|A fci / l u(s)|)5 |A fcj/l ii(s)|~5 A fcih i7(-,s,Du)?7|| i2(£1) 
< {L 2 H + e)\\T^ K {\/\ Kh u\) h |A fcjfc u|-'DA fclh «»/||i a(J , ) 
+ c(L, W) (1 + \\Duf^% + ||/ H ||J. (iJ) ) • 

n (apt 77) 

For every s £ (0, T) we now define 

G"(u, f)(s) := ^G p (a) + 1 + ll^ll^^ + IMI^Jg,^ + , (4.12) 

which is a L 1 (0,T) with probability one. Furthermore, we set G q := ^c"G"(u, f) > G p which immediately 
gives Y q < Y p . Then, taking into account the smallness condition (4.10), choosing e sufficiently small and 
combining the previous estimates for the various terms arising in (4.11), we find a preliminary (though still 
-fT-depending) pathwise estimate 



\\Tq tK (\A kth u(t)\)iri\\ 2 LHD) +c l {L H ,n,v) I \\T q K (\A kJl u\)? \A kM .u\ * DA k . h ur]\\l 2{D) ds 

Jo 

< \\T 9 ^{\A k ^uo\)h\\h(D) + c " I {\\T q A\^hu{t)\f^f L ,, D) + 1) G"(u, f) ds 

Jo 

+ cf \\f H (s)\\ a La(D) ds+ f (V(A k . h u{x,tj)7f 7 A k j l H{-,s : Du)dB s ) L 2 {D) . 



Step 2. We may now apply in a first step Ito's formula in exactly the same way as before in the derivation 
of estimate (4.9): 



e-So-''^'^f)^\\T q!K (\A^ h u(t)\)h\\l 2(L 



; o|)Mlr,2fm +l + c J \\fH(s)\\ a La( 



+ c [ e- J? c " G " (u '^ dS ( l/(A fe . hU (x, t)) ?? 2 , A kth H(-, s, Du) dB s ) LHD) . 
Jo 

Step 3. Similarly to the proof of Lemma 4.2, we introduce the random time 

inf ji e [0, T] : J || \A k ^ h u(s)\ 2q ~ 1 rj 2 \A k j l H (•, s, Du)\ H^i^ ds > i?| 



with tr = T when the set is empty. Differently from Lemma 4.2, the property 

r-T 



< OO = 1 



P(J || \A k!h u(s)\ 2q - 1 v 2 \A k , h H (., s, Du)| I)' ds 

which is needed to have P (limjj_ ! . 0O tr = T) = 1 is not clear a priori. We shall prove it a posteriori. 
Notice that, by Lemma 3.11, 

t/\TR p 2 

e -a/ 'c c («,/)«« ^ j^\v(A k , h u(x, S ))\r} 2 \A k , h H(s, Du)\dx) ds 

t/\TR 



< J \\\A k , h u(s)\ 2q -\ 2 \A k , h H(-,s,Du)\\\ L1{D) ds<R. 
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Step 3a. The last calculation shows that the stochastic integral from Step 2, stopped at tr, is a martingale 
(and thus it has zero expectation). Therefore (as in Lemma 4.2) 



E 



e~ti" R c '' G ''^ ds \\T q!K (\A k , h u(tAT R )\)^\\l 2{D 



c- x E 



tf\T R 



■^^'^'(«.fl« IS ||r; jf (|A fc , h «|)*|Afc lfc «|-4 J DA fclh tt»7||' a(i , ) 



ds 



I 1 1 2 



< \\T qiK (\A k MU \) 2 ri\\ L2{D) + l + c 



E 



\Ms)\\ La{D) ds 



At this stage we may pass to the limit K — > oo via Fatou's Lemma on the left-hand side and monotone 
convergence on the right-hand side, and we obtain 



E 



tf\T R 

c-'E 



|2 



e -/J «"G"(«,/)«wi| |A fclh tt(s)[«- 1 J DA fc>fc «(s)f 7 |i:, — ds 



i2 

li=(Z3) 



< ||W g (A fei/l ito) v\\ L 2 {D) + 1 + c 



|/»(*)|| £ . (IJ) ds 



(4.13) 



Step 3b. Next we apply Burkholdcr-Davis-Gundy inequality to the inequality above stopped at tr, 
raised to the power -. Taking the limit K — > oo as in (4.13), we get 



9 

1 f tAT H 



sup e 

te[o,T] 



W q (A kM u{t A t r )) r/\\ q L2{D) 



< \\W q (A kth u )n\\l HD) + l + cE 



\fH(s)\\ La(D) ds 



CE 



e -2/o*^''(«>/)««|||A fc)llU (a)| 2 «-V|A fc)A fr(^«,I)u)||i; w „, 



i2 



Since due to Holder's inequality we have 



||A fe , hU ( s )i 2 «-yiA fe , h ff(-, s,^)! 



I £!(£>) 
19-1. 



< |||A fe ,, u ( s )|%|| 

the last term of the previous inequality, similarly to the proof of Lemma 4.2, is bounded by 

rl/2 7-1/2-1 



< \E[h] + ^E[I 2 ] 



where 



h= ^p e-U^ Rc '' G "^ dS \\W g (A kth u(tAT R ))4i 2(D) , 
te[o,T] ' 

{•TAtr I 

e -/«c"G"(«,/) ds -|| |A MiU ( s) |9- 1 ?? |A fc ,, J ff(., s , J Du)|||' 2(I)) ds)' 



Hence, we have proved that 



-E[h] < \\W q (A k>h u ) V \ 



1 + cE 



\f H (s)\\ LaiD) ds 



r 2 
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Now, by the assumptions (2.2) on H, Young's inequality and the bound on q, we have 



C 2 

— E[I 2 ] <CE 



TAT R 



;-^c"G"(«./)«|||A fcifc «( fl )|'- 1 DA fclh «(«)»7| 



L 2 (D) 



ds 



C + CE 



+ CE 



\Ms)\\ LHD) ds 



e - f- c"G" (u,f)d~s 1 1 Wq {Du{g) ) 1 1 2 ^ 



We observe that the last term remains bounded, due to the assumption of the lemma on the average and the 
choice of G"(u, /) (which ensures that c"G"(u, f) > 2qG p ). Thus, by inequality (4.13) proved above, we find 



'rE[I 1 ]<c\\W q (A k ,hUo)v\\hlD)+ C + cE I Vh(s) 



\L-{D) 



ds 



with a new constant. 

Step 3c. In Step 3a and Step 3b we almost proved the two bounds claimed by the lemma since the 
previous inequality along with (4.13) gives us 



E 



sup e i 
te[o,x] 

■TAT R 



c '' G '' {uJ)ds \\W q (A k , h u(t Ar R )) v \\l 2 



L 2 (D) 



+ E 



,- £ c"G"(u ,f)ds II | A U ( fl )|9-1 DA k , hU (s) T}\ 



I) 



< c [\\W q (AkMUo)r]\\ L2{D) +c + c 



E 



\Ms)\ 



L°(D) 



\L*(D) 

ds 



ds 



It now remains to justify (as already observed above) the limit tr — > T as R — > oo with probability one. 
Indeed, since R <— >■ tr is non-decreasing and bounded above by T, there exists the a.s. limit 

t := lim tr 

R— >oo 

and t(uj) G [0,T]. By Fatou's lemma and monotone convergence, 
E 



( sup e-f« c " G "^ ds \\W q (A k , h u(t))v\ 

X te[0,r] 



2 

L 2 (D) 



+ J e-^ c '' G ''^ ds l|A feihU ( s )|5- 1 Z»A fei/lU ( s )r ? ||' 2p) d S 

is finite, hence the argument of the expectation is finite with probability one. Since J Q T c!'G"(u, f) ds is finite 
with probability one, we get 



sup \\W q (A k ,hu(t))r]\\ 2( .+ / |||A fc ,hM(s)| 9 1 DA kjh u(s) r]\\ , 2 . „, 
te[o,r] v ; Jo ; 

with probability one. Thus (with the same inequalities used above) 



||A fc , h ti(«)| a «-V|A fc , h ir(- > s J i?u)|||' 1(D) ds 

<c(l+ sup \\W q (A k , h u(t)) v \\ 2 + f |||A fclh u(s)|«- 1 .DA fcl , l u(s)» 7 
v te[o.r] v ; Jo 



(is < oo 



\L 2 (D) 



+ / \\W q (Du(s))\\ 2 L2{DI) ds+ / ||/ff(*)||£ 0(D) d«) 



f/.s 
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with probability one. If t(oj) < T, by definition of tr we have 

J || |A Ml u(s)| 29 ~V \&k,hH(; s, Du)\ f Ll{D) ds = oo 

which is false, hence P(r = T) = 1. Having this basic fact, the same estimates just proved give us the result 
of the lemma, by taking into account the inequality \DW(A^i l u)\ < q\ Aj. ^u^^ 1 \DA^ ^u\ and the definition 
of G q (and hence of Y q ) given after (4.12). □ 



5 Proof of the regularity result 

Having the previous lemma at hand, we may now proceed to our main result. 

Theorem 5.1. Let u be a weak solution to the initial boundary value problem to (2.3) with initial values 
u{ ■ , 0) = ito(-) G W 1 ' a ~ 2 (D, Bi N ). Assume further the assumptions (2.1) with v > (n — 2) / 'n such that 

*<™ , w-^nj([ 1 -( ! T 2 ) , l 4 -[ 1 -^]')- 

Then there exists a > depending only on n, v and a such that for every subset D c <s D we have 

P{\mC°.«(D c x[0,T],W) < Oo) = 1. 

Proof. To prove the result, we want to apply Proposition 3.9. Therefore, the crucial point is to show higher 
integrability of Du for "great" powers with probability one, in order that hypothesis (3.6) of the proposition 
is satisfied. We start by defining a sequence 

Qo ~ 1 , 

r n + 2 n+2a-A a — 2 i . 

q j+1 := mm \ qj , 1 + q i , — — , q d + 1 \ for j > 1 . 

I n n a 2 i 

Before defining a further sequence (qj) in order to perform the iteration, we make some observations on L* H (s) 
as a function in s 6 [1,2/(1 — v)] (we note that L* H (2q) already appeared in hypothesis (4.10) which gave an 
upper bound for q in the iteration). Clearly, L* H (s) is strictly decreasing in s, with L* H (2/(1 — v)) = 0. 

We now set qj = qj as long as qj > qj-i and L* H (2qj) > Lh, and for the first index j which doesn't satisfy 
these assumptions any more we set qj = q* for a number q* > n/2 (which is determined below). In what 
follows we shall denote this set of indices by J C N . We first study some properties of the sequence q and 
give a definition of the final member q* of the sequence (qj)j € j: the first and the forth term in the rewritten 
formula for q are strictly increasing in j and diverge for j — > oo, whereas the the monotonicity properties of 
the second term depend on both the values of a and the size of qj. More precisely, if a > 2(n + 2), then the 
second term increases with j and diverges for j — > oo, but for every a £ (n + 2, 2(n + 2)) it increases only up 
to qmaxia, n) = na/(A(n + 2) — 2a) > n/2. Observing L* H (n) > Lh by assumption, we thus define 

q* := arbitrary number in Q, min j {L^y 1 (L H ), - 2 ,q ma x}^ ■ 

It is easy to calculate that this number q* is reached after a finite number of steps (depending only on n, v 1 a 
and the difference q m ax — q* (in the sense that the number of steps diverges as q* /*• q max ), hence \J\ < oo, 
i.e. (qj)jeJ is a finite sequence. 

We are now going to establish by induction that for every j £ J we have 

(i) sup E[\\Y$ W q y/\ kJl u)\\ 2 ^ } < Cj for all fc G {l,...,n}, 

\h\<dist(D j ,dD j . 1 ) 3 y ' " 

(ii) sup E[\\Y q JW qj (A k>h u)\\ 2/ 2 9 ^ J <c{n,D 3 )C 3 for all fee {l,...,n}, 

(in) E\\\Y^W Q .(Du)\\ 2/q y 2 }<C j , 



■(L> J x(0,T)) J 



(iv) Du £ L°°(0, T; L 2qj (D j7 R nN )) with probability one. 
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Here (Y qj )j £ j is a sequence of random variables given by Y qj (t,uj) = cxp(— L G qj (s,uj)ds) for each j G J, 
for a sequence of functions (G qj )j£j which are in L 1 (0, T) with probability one and which will be determined 
later, and (Dj)j £ j is a monotone decreasing sequence of open sets satisfying D c C Dj C -Dj-i C . . . C -Do C 
D_i = D. 

We start by setting 

Y 1 ;= f° c'G'(u,f)ds 

where G'(u,f) was defined in (4.7). It is obvious from its definition that Y\ \ [0, T] x f2 — >• (0,1] satisfies 
P(inftgfo j x] ^i > 0) = 1. We then observe from Lemma 4.2 that 

sup 

|fe|<d 



sup ||yiA fc , h u||£ a(Do) + / ||yi£)A fcih u||i a(Do) dt 
te(o,T) Jo 



< ^(||£> fc «o||i a(IJ) + 1 + £[||// a ll!^)]) == Co 



is satisfied for every open set Dq compactly supported in D. By definition of the space V 2 , this establishes 
the statement (i)o- Furthermore, (ii)o follows immediately from the Sobolev embedding (2.4), applied for 
P-almost every uj to the functions Y\ Ak,hu, for k G {1, . . . , n}. To conclude the first step of the iteration it 
only remains to justify the statements (iii)o and (iv)o- To this end we take advantage of Theorem 3.6 twice, 
in the way as explained in Remark 3.7 (and actually as already performed in Corollary 4.4). First we apply it 
with the choices p = q = 2qo^^- to the inequality from (ii)o (for all k G {1, . . . , n}), leading to the existence 
of Du in the Lebesgue space L 2 (™ +2 )/"(Z?o x (0,T),M. nN ) with the required estimate for the average of Y\Du\ 
secondly, we apply it with the choice p = 2qo and q = oo to (i)o - more precisely to the first term in the 
V 2 -norm - and, keeping in mind the pathwise strict positivity of Yi, we end up with the existence of Du in 
L oo (0,T;L 2 (D ,IR nJV )) with probability one. 

We now proceed to the inductive step. Assume for a given j € J that (i)e—(iv)e are valid on open sets 
Df C Dt-\ with random variables Y qe : [0, T] x CI — > (0, 1] of the required form for all i G {0, . . . , j — 1}. 
Then, keeping in mind (iii) 3 _i and the definition of the number q* , we note that the assumptions of Lemma 
4.5 are satisfied (for p, D' replaced by qj—i, Dj—i), and we hence deduce (with the admissible choice q = qj) 
the estimate 

sup E 

\h\<dj 



( sup \\Y% W qj (A k , h u)\\l HD , + / \\Yg DW qj (A fclh u)||i a(Dj) dt) ' 
v te(o,T) Jo ' ' 

<c(||^ J _ 1 p fcU (x ) 0))|| 2 2(D) + l + ^[||/^( S ))||J o( ^ ) ]) 



for every k G {1, . . . , n}, a domain Dj C Dj_i satisfying dj := dist(Dj, dDj_i) > and a random variable 
Y qj defined via G qj given in Lemma 4.5 and satisfying in particular P(inf te [ .T] Y qj > 0) = 1. This shows 
(i)j, and (ii)^ in turn is an immediate consequence after the application of the Sobolev embedding as above. 
Moreover, the statements (iii)j and (iv)j again follow from (ii)^ and (i)j, respectively, after the application 
of Theorem 3.6 with the choices p = q = 2qj^^- and p = 2qj, q = oo, respectively. This finishes the proof of 
the induction. 

As an immediate consequence of the induction, we can now conclude the desired higher integrability result 
to a great power, via the following observation. Via (iv) we find in the limit 

Du e L°°(0,r;i 29 *(Li c ,M' iAr )) 

with probability one, and by definition the exponent 2q* is greater than the space dimension n. Hence, 
assumption (3.6) of Proposition 3.9 is guaranteed. For its application we still need to check the integrability 
condition on a(x, s), b(x, s) given by 

a(x, s) := div A[x, s, u, Du) and b{x,s) := H(x,s,Du) . 

Since A(x,t, u, z) is differentiable in x, u, and z with bounds (2.1), we obtain a G L 2 (D C x (OjT),]]^) 
with probability one as a direct consequence of Du G V 2 (D C x (0,T),R nN ) and / G L a (D T ) C L A {D T ). 
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Furthermore, the growth of H according to (2.2) with f H G L a (D T x 0) implies 6 G L 2+£ (0, T; L 2 (L> C , R™'^) 
with probability one. Thus, Proposition 3.9 yields the asserted Holder continuity of u with probability one 
and finishes the proof of the theorem. □ 



6 Regularity of the average due to noise 

It has been recently proved that a Stratonovich bilinear multiplicative noise may have a regularizing effect 
on certain classes of PDEs, see [8] for a review, based on a number of works including [9, 11, 1]. In most 
cases, uniqueness by noise is the topic of these works. The problem of the interaction between noise and 
singularities is more difficult and less explored. But two examples are known: 

(i) for linear transport equations of the form 

du = (b(x, t) ■ Du) dt + aDu o dB t 

with b G C(0, T; C£*(R n , K™)), where regular initial condition may develop discontinuities in finite time 
in the case a = (think of the simple example in dimension n = 1 given by b(x) = — 8ign(x) s/\x\) , it 
is known that C 1 -smoothness is preserved for a ^ 0, see [9, 10], where similar results have been also 
proved for linear continuity equations; 

(ii) for the point vortex motion associated to the 2D Euler equations, it has been proved that coalescence 
of vortices cannot happen when a suitable Stratonovich bilinear multiplicative noise is added to the 
equations, see [11]. 

One should also notice that other singularities, like those arising in the inviscid Burgers equation, do not 
disappear under noise, see [8], so each equation requires its own understanding and investigation. Moreover, 
no general method exists to investigate these kind of properties. 

Our aim here is to give a simple partial result in this direction (namely the effect of noise on singularities) 
for linear systems. We consider the linear stochastic system with Stratonovich bilinear multiplicative noise 
of the form 

du = div (A(x, t)Du) dt + aDu a dBt , u\t=o = uq (6-1) 

with bounded measurable coefficient matrix A, where Bt is a Brownian motion in K n , defined on a filtered 
probability space (fl,Ft,P). The space variable x varies in a possibly unbounded regular open domain 
D C K™. On A we assume that there exist Ao, Ai > such that 

AolCI 2 < {A(x,t)t,£) and \A(x, t) £| < Ai|£| (6.2) 

for all £ G M. nN , a. e. (x,t) eflx [0,T]. Actually, this is analogous to assumption (2.1)2 (then v corresponds 
to the ratio rewritten for vector fields which are linear in the gradient variable. We further note that 
for now we do not assume any regularity with respect to x, but at the same time we do not allow any 
dependency on f2. Let us clarify the vector notation used in the stochastic part: aDu(x,t) o dB t is a vector 
with N components, and 

n 

(<tDu(x, t) o dB t ) a = a D lU a (x,t)o dB\ . 
t=i 

Remark 6.1. Let us recall that Stratonovich noise is the natural one for modelling: the so called Wong-Zakai 
principle, proved for several classes of SPDEs (see for instance the appendix of [9] for the linear transport 
equation), states that solutions u n (x,t) of deterministic equations with smooth random coefficients B n (t) of 
the form 

= div (A(x,t)Du n ) + aDu n * , u n |t=o = «o 

converge (in proper topologies and under proper assumptions on B n , the details depend on the problem and 
result) to solutions u of the previous SPDE with Stratonovich noise (not ltd noise). We have stated the 
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principle for our system of parabolic equations just for sake of definiteness, but in fact it has not been proved 
before in this generality. We do not want to give a proof here, which would require a considerable work. We 
only quote this fact by analogy with other equations, as a general motivation for the choice of Stratonovich 
noise. 

Let us give the definition of weak solution to equation (6.1), similarly to [9]. To understand one of the 
requirements (the fact that s i— > J D u(x, s)Dtp(x) dx must have a modification which is a continuous adapted 
semi-martingale), we recall a few facts about Stratonovich stochastic integrals, taken for instance from [17]. If 
B t is a (f2, F t , P)-Brownian motion in K n and X (t) is a continuous ^-adapted semi-martingale, the following 
uniform-in-time limit exists in probability 



f\( \ a r r V" X{t l+l At)+X(U) ^ 
/ X(s)odB s = hm 2^ o ( B U+U 

JO n->oo 2 



ti£7r„ ,U<t 

and is called Stratonovich integral of X with respect to B. Here 7r„ is a sequence of finite partitions of 
[0, T] with size 7r„| — > and elements = to < t\ < .... Under the same assumptions it is defined the joint 
quadratic variation between X and B: 

[X,B} t = lim V {X(t i+1 At)-X(ti))(B ti+lM -B ti ), 

n— >oo * — ' 

ti^TV n .ti<t 

and they are related to the ltd integral 



/ X(s)dB s = lim V XMiB^t-Bu) 



(which is defined under more general assumptions on X) by the formula 



f X{s)odB s = f X{s)dB s + \[X,B] t 
Jo Jo 1 



Definition 6.2. If u 6 Lf oc (D,R N ), we say that a random field u(x,t) is a weak solution of equation (6.1) 
if: 

(i) with probability one, we have u £ V 2 (Bt,1& n ) for all bounded open sets B C D, where Bt = Bx (0,T), 

(ii) for all tp £ Cq°(D, R n ), the K n -valued process s t— > J u(x, s)Dip(x) dx has a modification which is a 
continuous adapted semi-martingale, and for all t £ [0,T], we have P-a. s. 

u(x,t) ip(x) dx + / / A(x, s)Du(x, s)Dip(x) dxds + a / I / u(x, s)Dip(x) dx) o dB s 
Jo Jd Jo ^Jd ' 

uo(x)ip(x) dx . 



s i — ^ J D u{x, s)(p(x) dx has a continuous modification. We shall always use it. Notice further that we give the 
g meaning to the vector notation above: 

n N 



A posteriori, from the equation itself, it follows that for all (p £ (D, W N ) the real-valued process 
> J D u(x, s)(p(x) dx has a continuous modifica 
following meaning to the vector notation above: 

[ ([ u(x, s) Dip(x) dx) o dB s = V V / (/ u a (x,s)D l ip a (x)dx) odB l s 
Jo ^Jd ' i=la=1 Jo ^Jd ' 

Proposition 6.3. A weak solution in the previous Stratonovich sense satisfies the ltd equation 

u(x,t) (p(x) dx + / / A(x, s)Du{x, s)Dip(x) dxds + a / I / u(x, s)Dip(x) dx] dB s 
Jo Jd Jo ^Jd ' 

= / uo(x)(p(x) dx + — / / u(x, s)Aip(x) dx ds 
Jd 2 J J D 
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for all ip £ (DjK^). The converse is also true. With a language similar to that of Definition 6.2, we 
could say that u is a weak solution of the ltd equation 



2 

du = div ^A(x,t) + —^jDiij dt + aDudB t , w|t=o = uq . 
Proof. From the facts recalled above about Stratonovich integrals we have 

' [ u a {x,s)D l i P a {x)dx] odB\ 
o k Jd ' 



(6.3) 



v JD 
1 



u a (x,s) D z (p a (x)dxj dB 
u a (x,-)D l tp a (x) dx,B 

* L JD 

Hence, we get 

i(x,t) <p(x) dx + j J A(x,s) Du(x,s) Dip(x)dxds + a J (^J u(x, s) Dtp(x) dx^j 

71 N „ 

u (x) <p(x) Ac ~ £ £ £ [ / D iV a {x) dx, B l 

1 i=l Q =l lJ D 



dB s 



D 



By the equation in Definition 6.2 we also have 



u(x,t) Diip(x) dx + / / A(x, s) Du(x, s) DDiip(x) dx ds 
'o Jd 



D 



u (x) DiLp(x) dx — a J J u(x, s) DDi<p(x) dx^J odB s 



Moreover, recall that 

ri 



J ^ J u(x, s) DDiip(x) dx^j odB s = J (^J u(x, s) DjDi(p(x) dx^J o dBl . 
Thus, by the classical rules about quadratic variation, see [17], we have 

N r r 

X [ _/ u a {x,-)D i ip ol (x)dx,B i = J u(x,-)D i tp(x)dx,B l 



u(x, s) DiDiip(x) dx J ds 



a=l 



J D 

The proof that the Stratonovich equation yields the Ito one is complete, and the proof of the converse 
statement is the same (recall that the existence of the continuous modification in (ii) of Definition 6.2 follows 
immediately from the equation in Definition 2.2). □ 

The degree of parabolicity of the Ito SPDE (6.3) is the same as the one of (6.1), it is given just by the 

2 

properties of A(x,t). The term ^-Au(x,t)dt is fully compensated by the Ito term o~Du(x,t)dB t and does 
not contribute to any additional parabolicity. This is a well recognized phenomenon in the theory of SPDEs, 
see for instance [16]. A simple way to see this fact is to consider the case A = 0. 



Proposition 6.4. Consider the equation 

du = Du o dBt, u\t=o = uq 



(6.4) 



in the full space D — R™, where B is an n-dimensional Brownian motion and u:Dx[0,T]x!l-> M. N . This 
is equivalent (when formulated in a weak sense) to the equation 



du = -Audi + DudB t , M|t=o = uo- 



(6.5) 
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Assume u e L 2 (D,R N ). Then 

u(x, t) = uq(x + B t ) 

is a weak solution, in the sense that 
(V) with probability one, we have J Q T J Rn \u(x, t)\ 2 dx dt < oo, 
(W) condition (ii) of Definition 6.2 hold true. 
Proof. Condition (i') comes from 

/ / \u(x, t)\ 2 dx dt = / / \u (x + B t )\ 2 dx dt = / / \u (x)\ 2 dx dt < oo . 
Jo JK" Jo Jts." Jo Jts." 

Condition (ii) of Definition 6.2 is due to the following argument. For every ip S (M™ , R N ) we have 
u{x,t)ip(x) dx = / uq(x + B t )ip(x) dx = / uo(x)ip(x — B t ) dx 

iE" Jr" 

and ijj(x — B t ) is the semi-martingale 

4>(x-Bt) =^{x) 
Consequently, we obtain 

u(x, t)ip(x) dx = / uq(x)%Ij(x) dx — I uq{x)Di[){x — B s ) dx^j dB s 

Uq(x) Aip(x — B s ) dx) ds , 



/ Dip(x - B s )dB s + \ [ Aip(x-B s )ds. 
Jo 2 J 



which shows that the stochastic process s i— >■ J Rn u(x, s)ip(x) dx has a modification which is a continuous 
adapted semi-martingale. In addition, this computation may also be used to prove the equivalence with the 
ltd formulation (6.5). Finally, 

u(x,t)(p(x)dx — / uo(x)(p(x)dx + / I / u(x, s)Dip(x) dx ) odB s 

JR" Jo W]R" ' 

Uo(x)ip(x — B t ) dx — / u a (x)(p(x) dx + / I / uo(x)Dip(x — B s ) dx) o dB s , 

JR" JO Wl» ' 

and this is equal to zero because 

ip(x — B t ) = <p(x) — / Dip(x — B s ) o dB s . 
Jo 

Thus, also condition (iii) is satisfied, and the proof is complete. □ 

Remark 6.5. The previous proposition shows that the ltd equation (6.5) has no regularizing properties, in 
spite of the presence of the term ^Ait (it is fully compensated by the ltd term). In particular, if uo = l^^O; 
the solution u(x,t) — l g i <Xl is discontinuous in x for every given (t,Lu). At the same time 

E[u{x, t)} = E[uo{x + B t )} = P{Bl < xi) 

is smooth. This means, there are easy examples of a weak solution which have a smooth average, but which 
are irregular with probability one. Thus, smoothness of E[u(x, t)] does not imply smoothness of u(x,t), and 
so in general the regularity of E[u(x,t)] is not enough to hope for regularity of u(x,t) itself. However, it 
is important to observe that this example started from an irregular initial data, and that this singularity 
was preserved in time. Obviously, the same reasoning applies to see that in fact every solution to (6.4) is 
Holder continuous in Dt if Uq is additionally assumed to be Holder continuous, so in particular in the case 
uo 6 W 1 ' q (D, M. N ) for some q> n (which was always required for the regularity statements before). 
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Now let us come back to weak solutions to the general linear system (6.1) with Stratonovich noise. The 
crucial observation is that the average of u solves an equation with improved parabolicity. Let us first recall 
the classical definition used also before in this paper. If v G Lf oc (D,R N ), we say that a (deterministic) 
function v(x, t) is a weak solution of the parabolic equation 

-£=div((A(x,t) + ?-)Dv), v\ t=Q =v Q (6.6) 



dt \\ v ' ' 2 

if v G V£ c (Dt,1& n ) (in the sense of (i) of Definition 6.2 above) and 

v(x, t)ip(x) dx + / / A(x, s)Dv(x, s)Dip(x) dx ds 
Jo JD 

a 2 f l 

vo(x)tp(x) dx — / / Dv(x, s)D(p(x) dx ds 



o JD 



for all ip G Cq°(D, 



Proposition 6.6. If u is a weak solution of equation (6.1), then 

v(x, t) := E[u(x, t)} 
is a weak solution of the parabolic equation (6.6). 

Proof. Step 1. We first observe that for these linear systems, we have the a priori boundedness of the 
solution u in the sense that 



sup E 

te[o,r] 



\u{x, t)\ 2 dx 



E 



T 

J B 



Du(x, t)\ 2 dx dt 



< oo 



(6.7) 



for all bounded sets B C D. The proof of this property follows the line of arguments of the proof of Lemma 4.1 
(but is in fact much easier). We do not want to go into details, but only mention the peculiarities. First, by 
the linear structure in (6.1) the function Go appearing in Lemma 4.1 can be chosen constant. This explains, 
why the estimate (6.7) doesn't involve weights as before. Furthermore, in Lemma 4.1 we were content with 
a bound for the expected value of the spatial derivatives of u only. However, adjusting the arguments from 
Step 3b in the proof of Lemma 4.2, we obtain a bound for the average for the full y 2 -norm for every bounded 
set B compactly supported in D. This immediately gives (6.7). 

Step 2. The regularity property v e L°°(0, T; Lf oc (D, R N )) is a direct consequence of the first condition 
in (6.7) from Step 1. In order to prove that also v G L 2 (0,T; W^(D,M. N )) holds true, we first observe that 
we have (a. s. in t) 



E 



DiU a {x, s) ip(x) dx 



= -E 


1 







u a (x, s) Diip(x) dx 



for all i[> G Cq°(D,M). This implies (by the integrability derived in (6.7)) 



E[DiU a (x,s)] ^{x)dx 



E[u a (x,s)] Diip(x)dx, 



which in turn gives us that E[u a (x, s)] is weakly differentiable in x with partial derivative equal to E[DiU a (x, s)]. 
Thus, v is weakly differentiable in x and 



T 

JB 



Dv(x,t)\ 2 dxdt 



< 



J B 
T 



JB 



\DE[u(x,t)}\ 2 dxdt 
E[\Du(x,t)\ 2 ] dxdt = E 



J B 
T 



E[Du(x,t)]\ 2 dxdt 
Du(x,t)\ 2 dx dt 



o JB 



< oo 
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for all bounded B C D. The regularity properties of v have been checked. 

Step 3. The property E[J Q J B \u(x, t)\ 2 dx dt] < oo implies that the Ito integral in the equation of 
Proposition 6.3 is a martingale, hence it has zero expected value. By the same assumption, we can interchange 
expectation and integrals, and we get 

v(x,t) ip(x) dx + E / / A(x, s) Du(x, s) Dip(x) dx ds 

'-JO JD 



a 2 



D * JO JD 



uo(x) ip(x) dx + — I j v(x, s) Atp(x) dx ds . 



From the property E[j^ J B \Du(x,t) \ dxdt] < oo and the boundedncss of A it follows that 



E 



JB 

t 



JD 



A(x, s) Du(x, s) D(p(x) dx ds = j / A(x, s) E[Du(x, s)] D(p(x) dx ds. 

o J D 



Since we know from Step 2 that E[Du(x, s)] = Dv(x, s), the proof is complete. □ 

Let us now explain the possibly regularizing effect of noise. Assume a = 0. Then, as already explained 
in the introduction, weak solutions may miss full regularity. One can find in [26] an example of matrix A 
satisfying assumption (6.2) and an example of a weak solution to the associated parabolic system (1.2) such 
that it is Holder continuous on a local time interval and then its L°° norm blows-up. More precisely, this 
matrix turns out to have an ellipticity ratio which is smaller than the critical one employed before, which 
was an essential ingredient in order to obtain globally Holder continuous weak solutions (see [14, 13, 15]). 
However, the matrix constructed by Stara and John [26] also fails to satisfy the regularity with respect to x, 
i. c. the matrix A is not differcntiable in x. For this reason it is not clear whether the counterexample could 
by constructed due to the small ellipticity ratio or the low regularity in i or a combination of both. As far as 
we know there is no counterexample available in the literature which answers this question, and so even in the 
deterministic setting this irregularity phenomenon for weak solutions of parabolic systems is not understood 
completely. Instead, for the elliptic (stationary) case Koshelev was able to give a sharp result, namely that 
(in the linear case considered in this section) full Holder continuity of the weak solution to div(A(x)Du) = 
holds provided that the matrix A is symmetric (for simplicity), measurable, bounded, and satisfies (6.2) with 



Ai + Ac, V n - 1 

The sharpness of this condition follows by a modification of Dc Giorgi's famous counterexample [5], see [15, 
Section 2.5]. Returning to the parabolic setting we now state a consequence from the previous Proposition 6.6, 
which for randomly perturbed systems (6.1) gives a regularity result for the average E[u(x,t)] if the matrix 
is assumed to be regular with respect to x. Since the existence of a deterministic counterexample is not 
clear, the Stratonovich multiplicative noise is only possibly regularizing, but in any case it might be of its 
own interest since the noise improves the parabolicity of the equation solved by the average. 

Proposition 6.7. Assume q > n, A with property (6.2) such that l-D^I is bounded uniformly by some 
constant L > 0, and let D C K" be a bounded, regular domain. Then there exists o~o > such that for all 
a > ao, all initial conditions uo € W l q (D, M. N ), and all weak solutions u of equation (6.1) satisfying (6.7), 
we have that (x,t) i — > E[u{x,t)\ is locally Holder continuous on D x [0, T]. One can take <7q depending only 
on 

Ao 

Proof. The matrix A{x,t) + ^-1 satisfies the assumptions needed for the deterministic regularity results 
in [14, 13, 15], see also Theorem 1.1, for all a greater than some ao which can be defined in terms of This 
implies that any weak solution v of equation (6.6) is locally Holder continuous onDx [0, T]. It is sufficient 
to apply this result to v(x 7 t) = E[u(x,t)]. □ 
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Remark 6.8. Given the ratio j 1 , the result is true for all matrices A with that ratio and all (regular) 
initial conditions. Intuitively speaking it looks impossible that regularization comes from the operation of 
mathematical expectation: it could regularize problems with special symmetries, such that singularities for 
different uj's average out (compare Remark 6.5 for this phenomenon under an irregular initial condition). 
But here A and Uq are quite generic (though regular). Thus we believe that Holder regularization takes place 
at the level of u itself. However, this problem is open. 
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